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The discovery of a Higgs particle is possible in a variety of search channels at the LHC. However, 
the true identity of any putative Higgs boson will, at first, remain ambiguous until one has experi- 
mentally excluded other possible assignments of quantum numbers and couplings. We quantify the 
degree to which one can discriminate a Standard Model Higgs boson from "look-alikes" at, or close 
to, the moment of discovery at the LHC. Wc focus on the fuUy-rcconstructiblc "golden" decay mode 
to a pair of Z bosons and a four-lepton final state. Considering both on-shcU and off-shell Z's, we 
show how to utilize the full decay information from the events, including the distributions and corre- 
lations of the five relevant angular variables. We demonstrate how the finite phase space acceptance 
of any LHC detector sculpts the decay distributions, a feature neglected in previous studies. We 
use likelihood ratios to discriminate a Standard Model Higgs from look-alikes with other spins or 
nonstandard parity, CP, or form factors. For a resonance mass of 200 GeV/c^, we achieve a median 
discrimination significance of 3 a with as few as 19 events, and even better discrimination for the 
off-shell decays of a 145 GeV/c^ resonance. 

PACS numbers: 14.80.Bn, 14.80.Cp 



I. INTRODUCTION 

The CDF and D0 experiments [1] at the Fermilab 
Tevatron are continuously improving their Higgs mass 
limits, and the ATLAS and CMS detectors at the CERN 
LHC are designed to discover [2, 3] the standard Higgs in 
all of the unexplored mass range, up to the high masses 
at which its raison d'etre is lost. While an un-discovery 
would be momentous, we focus here on the possibil- 
ity that evidence resembling the standard expectation is 
found. 

Because the idea is so venerable, one may have grown 
insensitive to how special a Higgs boson would be. Its 
quantum numbers must be those of the vacuum, which 
its field permeates. Its couplings to the electroweak gauge 
bosons and Z are proportional to their masses, as are 
its couplings to quarks and leptons. Any deviation from 
the predicted qiiantum numbers or couplings of a puta- 
tive Higgs boson would have deep ramifications for parti- 
cle physics. An experimental program for Higgs physics 
must be focused on the rigorous determination of these 
fundamental quantities. 

A Higgs boson discovery at the LHC will arise from 

excesses observed in one or more final states. Since the 
couplings and partial widths of a SM Higgs boson are 
predicted as a function of its mass, the size of any excess, 
the width of a reconstructed resonance, or a comparison 
of different channels may immediately give clues as to 
whether the putative new particle is consistent witli a 
SM Higgs boson. Nevertheless, the true identity of the 
new particle will at first remain ambiguous, until one 
has experimentally excluded other possible assignments 
of quantum numbers and couplings. We shall refer to 



these other possibilities as Higgs look-alikes (HLLs). 

The purpose of this paper is to quantify the degree to 
which one can discriminate a Standard Model Higgs bo- 
son from HLLs at, or close to, the moment of discovery 
at the LHC. There is a vast literature about determining 
Higgs properties from signals in a variety of final states 
(for a review, see [4]), but this research mostly addresses 
only the related question of wliethcir it is possible; at all 
to determine Higgs quantum numbers and couplings at a 
hadron collider. The current situation in this respect is 
similar to the LHC experimental program for supcrsym- 
metry, where only recently are there quantitative stud- 
ies of the potential to discriminate supersymmetry look- 
alikes at the moment of discovery [5]- [9]. 

Our study focuses on the so-called "golden channel" 
for Higgs physics, namely the Higgs decay H ZZ* 
^i^i^t^2 ^ where if 2 denotes an electron or a muon, and 
Z* denotes that one of the Zs may be strongly off-shell. 
This channel has the advantage that the kinematics of 
the Higgs and its decay products are fully reconstructible 
from a completely leptonic final state. Approximately 
half of the events will be /x^/i^e+e^, where all four lep- 
tons are easily distinguishable, and even in the 4/z and 4e 
final states all four leptons can be distinguished by the 
requirement that one or both Z bosons are reconstructed 
within an on-shell mass window. A well-measured, four- 
body, closed kinematic final state provides many inde- 
pendent observables for determining properties of the ob- 
served resonance; thus this channel provides more infor- 
mation than e.g. the Higgs decay into two photons, where 
the photon polarizations are not measured. 

The branching fraction for the golden mode is small; 
example values for a SM Higgs ZZ* M arc 0.0011 
for TOH=200 GeV/c2, 0.0014 for m//=350 GeV/c^, and 
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0.00036 for mH=U5 GcV/c^ [10]. Even for favor- 
able Higgs masses, this branching fraction is two or- 
ders of magnitude smaller than that for semileptonic 
H — >• W^W~ — >■ a channel that, though ham- 

pered by large backgrounds, is also fully reconstruct ible 
up to a two-fold ambiguity in the determination of the 
longitudinal neutrino momentum [11, 12]. The golden 
mode branching fraction is also smaller than that for the 
fully leptonic SM Higgs decay H W+W' i+vi'v. 
Nevertheless, for a wide range of SM Higgs masses, this 
mode is a promising discovery channel and would, in any 
event, be populated at or around the time of a putative 
discovery in a different channel. 

We factorize the HLL problem into observables related 
to production and observables related to decay. In this 
paper we perform a systematic analysis including all of 
the information from the putative Higgs decays, leav- 
ing the analysis of Higgs versus HLL production to later 
work. While this factorization of production and decay is 
not completely clean, we show that the resulting model- 
dependent uncertainty introduced into the decay analy- 
sis is small. A full analysis will include production in- 
formation and could produce stronger results than those 
presented here, since large cross section differences are 
expected between SM Higgs production and the produc- 
tion of many Higgs look-alikes. However, including Higgs 
and HLL production also introduces new theoretical and 
measurement uncertainties involving associated hadronic 
jets and the parton distribution functions that describe 
the initial state. 

One advantage of focusing only on Higgs decay in the 
four-lepton final state is that we can perform a realis- 
tic study without resorting to full simulation of a de- 
tector. This is demonstrated in Section IV, where we 
parametrize the relevant efficiencies, resolutions and ac- 
ceptances for an LHC detector. Because both the AT- 
LAS and CMS detectors in general measure muons and 
electrons with exquisite precision, the resolution and effi- 
ciency for detecting the four leptons can be significantly 
degraded with no impact on our results. 

This is not to say that detector effects are not impor- 
tant. We will show that the finite phase space acceptance 
of any LHC detector has strong effects on the HLL anal- 
ysis, causing a detector-induced sculpting of the angular 
distributions used for HLL discrimination. We demon- 
strate that these effects must be accounted for in order 
to avoid serious biases in the characterization of a Higgs 
signal. 

Our analysis depends on five distinct angles that de- 
scribe the H — ZZ* — >• 4£ decay process. In the case 
where one of the Z bosons is strongly off-shell, the SM 
Higgs versus HLL decays also differ in their dependence 
on the reconstructed Z* invariant mass. Because we are 
interested in HLL discrimination with small data sam- 
ples, at or near the moment of discovery, we need to use 
all of the decay information in the events, including not 
just the distributions but also the correlations between 
all five (or six) of the relevant observables. 



In the same spirit, we disentangle the Standard Model 
ZZ backgroimd from the putative Higgs signal using 
the gPlots technique [13]. This produces an effectively 
background-subtracted data sample where, instead of 
making stringent requirements that reduce the signal 
yield available for characterization, we reweight the se- 
lected events according to how likely each event is con- 
sidered to be signal by the fit, keeping the normalization 
to the signal yield found in the search. 

Previous analyses of the Higgs golden mode decay 
properties have examined the dependence on some of the 
relevant angular distributions [14]- [19] and have shown 
the potential for LHC measurements to discriminate a 
SM Higgs from look-alikes with different spin and par- 
ity assignments or CP properties [4],[17]-[33]. However, 
none of these studies utilized all of the decay information 
in the events, and all of them have ignored the effects of 
detector phase space sculpting of the angular distribu- 
tions. 

In our analysis we compare a SM Higgs signal to a va- 
riety of Higgs look-alikes. We consider the most general 
Lorentz invariant couplings of a massive, spinless boson 
to ZZ or ZZ*] this corresponds to gauge-invariant cou- 
plings up to dimension six. Some of the corresponding 
HLLs can be considered as modifications of the SM Higgs 
properties via P or CP violation or Higgs compositeness. 
Another spin HLL corresponds to a new massive pseu- 
doscalar, a particle occurring in models with extended 
Higgs sectors such as supcrsymmctry. 

Our HLL analysis also includes the most general cou- 
plings of a massive neutral spin 1 boson to ZZ or ZZ* . 
The off-shell case has not been presented before, to our 
knowledge. A spin 1 HLL is a special case of what is 
usually denoted as a Z' vector boson. The spin 1 part of 
our results is then also part of a Z' look-alike analysis, 
which is interesting in its own right [34]. 

We also discuss as one of our HLLs a massive spin 2 
resonance coupling to the ZZ energy-momentum tensor, 
not necessarily with the universality of a graviton-like 
coupling. Although universally-coupled massive gravi- 
tons are already experimentally excluded in the relevant 
mass range [35], general spin 2 HLLs are a natural exam- 
ple of our study of spin discriminations. 

In Section II we define our notation for the observ- 
ables of the four-lepton final state. Section III contains 
the general gauge and Lorentz invariant couplings of an 
HLL to ZZ or ZZ* , with a discussion of other symmetry 
properties. We describe in Section IV event generation, 
detector simulation, and the construction of effectively 
background-subtracted samples using gPlots; here also 
we show the sculpting of the angular distributions and 
correlations by the finite phase space acceptance of the 
detector. In Section V we describe our statistical ap- 
proach to HLL discrimination using hypothesis testing 
with likelihood ratios. We demonstrate in Section VI the 
consistency of our methods by applying them to the dis- 
crimination of signal from SM ZZ background. In Sec- 
tion VII we detail many examples quantifying our abil- 
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ity to discriminate a SM Higgs from a variety of HLLs, 
showing in each case the expected discrimination signifi- 
cance as a function of the number of signal events; we use 
benchmark Higgs masses of 145, 200, and 350 GeV/c^. 
We summarize, in Section VIII, our results and outlook 
for further improvements. Here we explicitly quantify 
the extent to which our expected discrimination signifi- 
cance would be degraded by using a less complete or less 
rigorous analysis. 



II. THE GOLDEN CHANNEL 

We are interested in the case of a SM Higgs boson, or 
a Higgs look-alike, decaying via ZZ or ZZ* into a four- 
lepton final state. We will denote the putative Higgs 
and its mass by H and mji, regardless of whether it is a 
SM Higgs or a look-alike. This notation is also used to 
describe background events, where the four-lcpton object 
is treated as a Higgs or HLL in the sense that m// stands 
for ma. Since the events are fully reconstructible the 
lab frame kinematics of the candidate H particles are 
known: their transverse momentum y^i pseudorapidity 
77, and azimuthal angle. These three variables define the 
direction and boost from the lab frame to the iJ rest 
frame. All other observables can then be defined with 
respect to the H rest frame, as illustrated in Fig. 1. 

The a azimuthal angle plays no physical role, while 
the "Pt and x] distributions influence the way the detec- 
tor selects events, sculpting the distributions of the final- 
state lepton's directions and energies. Once an event is 
boosted back to the 4 1 rest-system (the rest system of the 
two initial-state fusing partons) , the memory of and 
x] is lost, modulo these phase space acceptance effects. 

In the approximation that the final-state leptons are 
massless, 12 observables are measured per event. Since 
all 12 are well-measured there is no experimental reason 
not to re-express these in terms of whatever combinations 
most naturally capture the underlying physics. Thus we 
choose four observables to be mn and the three produc- 
tion observables just described that define the H rest 
frame. The remaining eight observables are taken to be 
the two reconstructed masses of the Z bosons together 
with six decay angles defined with respect to the H rest 
frame. 

In the H rest frame the reconstructed Z bosons are 
back-to-back. We label these bosons as Z\^ Z^ and take 
the direction of Z-2, as defining the positive 2-axis. Be- 
cause of Bose symmetry, the labeling is arbitrary; in the 
case of an e+e^/i+/i^ final state wc will follow the liter- 
ature [29] and choose Zi to be the Z boson that decayed 
to muons. We then adopt the additional convention that 
the transverse direction of the /x~ lies along the positive 
y-ax;is; thus the Zi decay leptons lie in the y-z plane. 

With the above choices, the reconstructed Z boson 
masses m\ and m2 also define the longitudinal boosts 
from the B. rest frame to the rest frames of the decaying 



Z\ and Zi bosons. The boost parameters are given by 
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We let ^1, kp\ denote the decay angles in the Z\ rest 
frame, while ^2, </'2 denote the decay angles in the Z^ 
rest frame. 

There are two additional angles O, $ defining the di- 
rection of the initial state partons as reconstructed in 
the H rest frame. For a gluon-gluon initial state these 
angles measure a rotation from the 2-axis defined above 
to the direction of the initial state gluon with positive 
^-component of momentum. For quark-antiquark (gg) 
initiated production of an HLL we have the problem 
that we do not know cvcnt-by-cvent which proton con- 
tributed the antiquark; this is resolved by symmetrizing 
the expected angular distributions under the replacement 
cos© —cos©. 

As expected, one combination of the three azimuthal 
angles $, <pi and <p2 is physically redundant. We take 
advantage of this fact to make the replacements 
^ + (j), ip2 ^ ^- Thus (p then represents the azimuthal 
rotation between the Z2 and Zi decay planes. 

In summary, the 4- momenta of the process gg ^ H ^ 
Z1Z2 —7- i^li £2^2 are explicitly parametrized in the H 
rest frame as 
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(72(1 + /32C2), 0, S2,12{P2 + C2)) , (3) 

(72(1 - /32C2), 0, -S2, 72(^2 - C2)) , 

(71(1 -I- /3ici), -s si, -c si, -71 (/3i + ci)) , 

(71(1 - /3ici), s si, c si, -71(^1 - ci)) . 



Here k denotes the 4-momentum of H, while pi, p2 are 
the 4- momenta of Zx, Z2- We used the condensed no- 
tation C, S'=cos0, sin©, c.s—coscj), sin^, ci,Si=cos^i, 
sin 01, and C2,S2=cos02, sin 02- 

Of the five relevant angles, © and $ are Z-paii produc- 
tion angles, while the remaining three are 41 production 
angles. We will use the notation 



n = cos©} , 

cJ = {(j), COS^i, COS02} • 



(4) 



For a SM Higgs, the distributions in and <I> are fiat if 
we ignore the phase space acceptance effects inherent in 
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any experimental analysis. In previous studies these two 
angles have typically been integrated over. 

Although we have tried to conform to the literature in 
our parametrization of the decay angles, we note that the 
literature itself is divided over the choice of which decay 
plane orientation corresponds to 0=0 rather than (p—ir. 
We conform to the convention of Buszello et al. [29], 
which is opposite to that of Djouadi [4] and Bredenstein 
et al. [36]. 




FIG. 1: The Cabibbo-Maksymowicz angles [37] in the H -5> 
ZZ decays. 

The decay amplitudes defined in the next section de- 
pend on two combinations of the boost parameters 71 
and 72, defined by 

la = 7172(1 + /3i/32) , (5) 
lb = 7i72(/3i +1^2) , (6) 

which are in fact just the cosh and sinh of the rapidity 
difference of Z2 and Zi , such that 

7.^-7.^ = 1. (7) 
More explicitly, we have 

la = {ml - {ml + ml)) . (8) 



form factors X to Q are functions of fc^ and pi ■ p2 which, 
with no loss of generality, can be taken to be real (but 
for their absorptive parts, expected to be perturbatively 
small). The rescalings by 1/M| are just for definiteness, 
since the true mass scale of the underlying operators is 
as yet unspecified. In practice we also remove an overall 
factor of igMz/ cos 6]^, so that X=l corresponds to the 
tree level coupling of a SM Higgs boson. 

Similarly, the most general vertex describing the cou- 
pling of a spin J=l particle to two Z-polarizations (in- 
dices jj, and a, momenta pi and P2, respectively) and to 
its own polarization (index p) is: 

iPA.« ^ X (gP^p'^+gP'^pt^) + [P+iQ) e'"'"(pi-p2), (10) 

again with X, P and Q real. 

The most general parity-conserving vertex describing 
the coupling of a J=2+ particle of polarization tensor e'"^ 
to our two vector bosons is: 

^P^Ma ^ m]j g^'P g""" 

+{X,+tY,){p^pPg'^P+p1pPg'^^) 
+{X2+tY2)p1p^,gP'', (11) 

where we have dropped contributions that have more 
than two derivatives or are odd under parity, and again 
with all coefficients real. The special case of tree level 
graviton-like couplings corresponds to 

Xo^-^K, Xl^K, X2 = -K, (12) 

with all other coefficients vanishing and k an overall cou- 
pling strength. 

These general couplings, with naive mass dimensions 
d = 3, 4, and 5, can arise from SU{2)]^ x U{1)y invari- 
ant operators of dimension 5, 6, or higher. Since, for 
HLLs with non-vanishing weak charges, this parentage 
introduces model dependence, we relegate it to a brief 
discussion in Appendix A. 

B. 'Pure' cases of specified J^^ 



III. COUPLINGS AND ANGULAR 
DISTRIBUTIONS 

A. General couplings to ZZ* 

The vertex Feynman rules for the most general cou- 
pling of a spinless particle to the polarization vectors 
and £2 of two Zs of four-momenta pi and p2 are given 
by the expression: 

L^a =Xg^^-{Y + iZ)^ + {P + iQ) e^J^ , (9) 

where we have suppressed repeated indices in the con- 
traction of the four-index e tensor, k=pi -f P2 and only 
Lorentz-invariance has been assumed. The dimensionless 



We specify in this section the results for four cases 
(scalar, pseudoscalar, vector and axial vector) that would 
be 'pure' in the sense of having a single dominant term in 
their HZZ couplings, which we use to define their spin 
and parity. This allows one to illustrate the mass and 
angular dependences of the predictions, setting the stage 
for the later discussion of the impure cases for which P 
and/or CP are not symmetries of the theory, and to es- 
tablish comparisons with the existing literature (but for 
the ZZ* case for J=l, which we have not found else- 
where) . 

The general expressions for the angular correlations in 
the ZZ* case (which includes ZZ when the two Z masses 
are fixed at Mz) are given in Appendices C and D, where 



5 



denotes the quantity arising from the SM couplings of 
the Z bosons to the final state leptons. 



3. A pure vector, J 



PC 



The standard Higgs, J 



PC , 



:0++ 



The tree level SM coupling of the Higgs to two Z's of 
polarisation ei and €2 is oc eyci, see Eq. (9). The angular 
distribution of the leptons in H ^ ZZ 4 1 decay, for 
on or off-shell Z's of mass mi and m2, is: 



rfr[o+] 

dci dc2 di 



1 1 4rii 22i/'2, 2\22 

mi m2 rriH [1 + c^c^ + (76 + c )s\S2 



+ 27aCSiS2 C1C2 + 2Tf-{cxC2 +7aCSiS2)] ■ (14) 



2. A pure pseudoscalar, J^*^ = 

The coupling of a J^'"=0 pseudoscalar to two Z's 
of polarisation ei and €2 and four-momenta pi and P2 is 
proportional to e[ei, £2,^1,^2], see Eq. (9). The angular 
distribution of the leptons in its ZZ — 4 1 decay is: 

dr[o-] 4 4 2 

oc mi m2 7b 



dci dc2 
(1 



2 2 
C1C2 



2 2 2 I o 2 \ 

c S1S2 + 2 ri C1C2) 



(15) 



The coupling of a J^^^l vector particle of polar- 
ization ch to two Z's of polarisation ei and £2 and four- 
momenta pi and p2 is oc en-ei €2-pi + en- £2 ei-P2, see 
Eq. (10). Unlike for the scalar cases, the fully differen- 
tial decay amplitude depends nontrivially on the angles 
Q and i", representing correlations between the helici- 
ties of the initial and final state particles. Assuming a 
quark-antiquark initial state this, in principle, introduces 
two new parameters: the vector and axial couplings of 
the (massless) quarks to the spin 1 HLL. However, once 
we symmetrize over cosO -f-)- —cos 6, reflecting our igno- 
rance of which colliding proton contributes the antiquark 
of the hard scattering, the dependence on these new cou- 
plings disappears except for an overall factor. Performing 
this symmetrization, we also introduce the notation 



rrij = TO, — Too 



(16) 



and find the angular distribution of the leptons in H 
ZZ* A I decay as follows: 



dC dci dc2 d^ dcf) 

,2„2 



cx 4mf m27b S^sfsl {2m^ - m% [ml cos(2($ -|- </>)) -|- m2 cos(2$)] ) 



+m%{l + C^) [2m2Si -I- 2m{s\ — (mf -|- m2)sf s^] -|- 4mijm^ C S [mici S1S2 sin($ + <!>) — 171202 S2S1 sin $] 
-2m%mim2SiS2 ((1 + C^)(ciC2 - ??^)c + S'^{ciC2 + rf) cos(2$ -|- cj))) 

I 



(17) 



4. A pure axial vector, J^^ = l"*""*" 

The coupling of a J^*-^ = 1++ axial vector par- 
ticle of polarization cr to two Z's of polarisation ei 
and €2 and four-momenta pi and p2 is proportional to 
e[ejif , ei, e2,Pi — P2], see Eq. (10). After the same sym- 
metrization in cosO described above, and introducing 



the notation 



Ml = 

f2 _ 



m\ — Sm^ 



ml , 



M2 = mjj — mi — 3m2 , 



(18) 



the angular distribution of the final state leptons is given 
by: 



dCrfc^ic \^d(l} °^ [M^mj cos(2($ + 0)) + M^mj cos(2$)] + Smlmlm'^S'^ [cl + cl + SiS^s^ + 2rfciC2] 

+m]j{l + C^) [2M^mlsl + 2M|m?s^ - {M^ml + MiVii).?^.?^] (19) 
— 8mHm^mim2C S [M|miS2 (c2SiCsin($ -f </)) + ci(ciC2 + ff ) sin$) 

-M1TO2S1 (ciS2Csin$ + C2(ciC2 + if) sin($ + 0))] 
-|-2ml^Mf M|mim2SiS2 [(1 + C2)(ciC2 - r?^)c - S^{ciC2 + r?^) cos(24> + </>)] . 
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5. A pure massive graviton, J^^ = 2+"'" 

Since the general analysis of spin 2 coupling to off-shell 

Z^s is quite cumbersome, wc will only quote results for 
the example of a positive parity spin 2 with graviton-like 



couplings produced by gluon fusion and decaying to two 
on-shell Z's. Defining the on-shell ratio x = itih/Mz and 
using the massive graviton formalism of [38], we obtain 
the tree level angular distribution: 



dr[gg — >• graviton — )• ZZ] 



dC dci dc2 d$ d(j) 



+8x^52 



[2 + + {2 - 3S')4]si cos($ 



[2 + + (2 - 3S')ci]si cos^$ 



+S''sisi,[x^ cos(2$ + ( 
+25^01 C2 si S2 \x^ [2{: 



.)2 + 16 c2] - {si + sl)[{x^ + AfC^ + 2(3x4 - 16)^2 + {x^ - 4f] 
;2 + 4) - (a;2 + 12)52]cos(2$ + (/>)+ 4 [4x^ - {3x^ + 4)S^]c . 

I 



(20) 



Note the cos^O dependence characteristic of a spin 2 
resonance. 



C. Tests of symmetries 



This term is CP odd and T-even, a combination not 
addressed by the theorem quoted above. It is a C-odd 
observable, in that it changes sign under the interchange 
of Pe+ o and ^ p^-, tantamount to cos^j -H- 
—cos 9i in our chosen notation. 



Now we discuss the behaviour of the HZZ cou- 
plings under various symmetries, including CP and Bose- 
Einstein statistics. The discussion attempts to clarify the 
literature on these issues. 

Consider the J=0 case. The most general coupling of 
a spinless particle to the polarization vectors ei and €2 of 
two Z's is that of Eq. (9). In computing the ensuing H — > 
ZZ* — 4£ process one finds that the XP interference 
term is of the form: 



oc 



dr[0,Todd] 
dci dc2 dcj) 

2 m\ ml m% 76 si S2 s [si S2 c 7a (ciC2 + r?^)] , (21) 

where the term sin^i sin ^2 sine/) oc Pe+ ' P^- ^ P/J.+ - By 
definition, this observable is T-odd: it changes sign as 
all three-momentae are reversed (the tilde in "T-odd" 
emphasizes that past and future are not being inter- 
changed) . 

The Born approximation is, by definition, the result 
of squaring the amplitude dictated by the Lagrangian to 
lowest order in its couplings: a quadratic result, in our 
case, in any pair of the quantities X to Q in Eq. (9). To 
this order, a T-odd observable must vanish if CP is a 
symmetry, as shown in [39]. Thus, a non- vanishing T- 
odd observable such as that of Eq. (21) can only arise if 
CP-invariance is violated. 

The XQ interference term resulting from Eq. (9) is: 



(ir[0,Codd] 



oc 



dci dc2 ( 
-2 Tj ml ml m% 7b [c\ 



C2] (1 -I- C1C2 -I- 7a S1S2 c) . (22) 



1. Bose-Emstein statistics 

The general coupling, up to two derivatives, of a J = 1 
particle to two Z's is that of Eq. (10). This is true 
whether or not the Z's are on-shell, which seems to be 
a point of confusion in the literature. Thus for example 
[23], whose authors were the first to emphasize the im- 
portance of Mz* as a discriminating variable, contains 
extra "off-shell" couplings, such as g^"{p-L — P2Y and 
e''^"(pi +P2), that violate Bose symmetry and vanish for 
two on-shell Z's. However, Bose symmetry is a property 
manifest at the Lagrangian level, and thus independent 
of any on- or off-shell considerations. The two Z's in 
an — > ZZ* decay are described by the same bosonic 
Z field, whether or not they are on-shell, and they do 
not obey the laxer rules that different particles (Z ^ Z') 
would. 



D. Tests of compositeness 

If the couplings of an HLL conserve P and CP, but 
the object is not point-like, there will be deviations from 
the standard g^i, coupling to Z's. To lowest order in 
the dimensions of the corresponding effective operators, 
these will be of two types. The first is a non-vanishing 
Y in Eq. (9), and the second is a nontrivial form for X. 
Barring large effects -quite conceivable in a model with 
multiple SM Higgs-like fields- deviations in X are much 
harder to limit or measure than a non-zero Y/ X which is 
governed by the shapes of angular distributions. Contri- 
butions to Y can arise from gauge invariant operators of 



dimension 5 containing a non SM-like spin HLL (Ap- 
pendix A) or from higher dimension operators containing 
the SM Higgs [40] -[42]. 

It is useful to introduce the notation tan^ = Y/X. In 
this notation, the "composite" HLL angular distribution 
is of the form: 

dTc = cos^^dTxx + cos^sm^dTxY+sm^^dTYY , (23) 

where dTxx is the standard result of Eq. (14). The in- 
terference term is: 

dVxY 

oc 

dci dc2 dtp 

-2 ml ml mjj ll Si S2 (ciC2C -|- 7a S1S2 + c) , (24) 
and the last term is: 

d^YY 4 4 4 2 2 /oc\ 

^^^-^ocm,n^,7.^r^2. (25) 

Contrary to all of the other cases we study, the inter- 
ference term in this instance is between two operators 
whose P and C are identical: the HLL is not point-like, 
but it is 'pure' 0^"*". As a consequence, the angular dis- 
tribution of the interference term is not very different 
from that of the XX and YY terms and the interference 
can, for certain values of Y/X, be very destructive. This 
can be seen even at the level of the H — ZZ branching 
fraction, the integral of Eq. (23) over cos 6*1 , cos 6*2, and 
0: 

Fc oc m\m\ [2cos^$ -I- (7aC0S^ - mim27bSin^)^] . (26) 

If ^ has a value close to the (mass-dependent) point of 
maximal interference, the golden mode channel can be 
suppressed by a large factor. For this to happen X and 
Y ought to be of the same order of magnitude, signifying 
a low dynamical scale for a composite Higgs. 

IV. ANALYSIS 

In this section we describe the modeling of the de- 
tector effects and the analysis strategy to extract an ef- 
fectively pure sample of signal events. We describe the 
Monte Carlo (MC) event generation and the simulation 
of the detector response. We use parameterized recon- 
struction resolutions and efficiencies based on the pub- 
lished CMS performance results [43]. A similar study 
can be performed with parameterizations based on the 
ATLAS detector. We focus on the four-muon (4/x) final 
state, but the results can be generalized to include final 
states with electrons. Since a four-lepton final state is 
relatively "clean" in the LHC environment, we apply a 
loose event selection and use a maximum likelihood (ML) 
fit technique to separate the signal from the background. 
This maximizes the statistical power and the possibil- 
ity of characterizing the nature of the discovered particle 
through the study of the multi-dimensional angular dis- 
tribution of the four leptons in the resonance rest frame. 
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FIG. 2: 2D pT-ri pdf oi a 0"*" HLL resonance (left) and a 1~ 
one (right) for ^^=10 TeV collisions, obtained using PYTHIA 
and the CTEQL5 parton density functions and for mH~U5, 
200, 350 GeV/c^ (top, middle and bottom). 



A. Event generation 

The knowledge of the four-momenta of the leptons fully 
specifies the information needed in this analysis. We gen- 
erate the four-momenta of the leptons from the five- or 
six-dimensional probability density functions (pdfs) of 

X = {lj, n} for ZZ , 

X = {lu, n, Mz*} for ZZ* , (27) 

where Q,l3 are given in Eq. (4). The X quantities are 
generated in the rest frame of the decaying resonance. 
The muons are then boosted to the laboratory frame, 
and the detector effects (acceptance, efhciency and reso- 
lution) are applied to the boosted momenta. We use the 
azimuthal symmetry of the LHC detectors to reduce the 
remaining kinematic degrees of freedom to the knowledge 
of the pt, t] and the invariant mass 7714^ of the 4/i system. 
The Pt, f] for the signal is taken from a two-dimensional 
pdf generated using MCONLD [44] . We consider proton- 
proton collisions at ^/s^lO TeV, and we model the par- 
ton distribution functions (PDFs) using CTEQ5L [45]. 

In this analysis we do not assume a specific signal pro- 
duction mechanism and cross section, instead relying on 
the discrimination provided by the angular distributions 
of the leptons in the final state. Figure 2 has the pT 
vs. rj pdfs for a spin-0 and a spin-1 HLL. As discussed 
in Section I, for all the signal generation we use the pT 
vs. J] pdfs of the scalar. For the SM ZZ background the 
Pt, t] and m^^ are taken from a three-dimensional pdf 
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generated using the PYTHIA [46] leading-order MC gener- 
ator. The momenta of the four muons in the rest frame 
of the ZZ^*^ system as a function of are generated 
according to the theoretical distributions. 





-0.5 0.; 
sin(ef"-6f™) 



70 80 90 100 T 

m^^ [GeV/c-] 

FIG. 3: Distribution of the dimuon invariant mass for a sam- 
ple of signal H — >■ ZZ events, generated using our very-fast 
muon simulation. The parameters of the superimposed fit are 
extracted from [47]. 



B. Detector emulation and event selection 

Muon reconstruction efficiency and resolution are pa- 
rameterized as a function of the muon and r\ ac- 
cording to [43] , where the muon reconstruction efficiency 
is close to 100% for muons vifith -pT > 10 GeV/c and 
\ri\ < 2.3, corresponding to the event selection in our 
analysis. The reconstruction efficiency is applied through 
a hit-or-miss technique. For muon candidates accepted 
by the efficiency filter, the reconstructed momentum is 
determined by applying Gaussian smearing functions to 
the true pt, Tj and with p^- and 77-dependent resolu- 
tions. We verified the goodness of our very-fast muon 
simulation by comparing the parameters of the fit of the 
Z invariant mass distribution obtained in our analysis, 
see Fig. 3, with the corresponding ones from a published 
full-simulation analysis [47]. 

A number of detector related effects can modify the 
X observables' pdfs. The resolution of the observables 
used in the analysis is shown in Fig. 4 and is found to be 
small independent of the HLL resonance mass and quan- 
tum numbers. The systematic bias in the reconstruction 
of the same variables is shown in Fig. 5 and is found to 
be negligible. This shows that the sculpting of the ob- 
servables' pdfs is not a result of reconstruction resolution 
or bias. Rather, it depends on the simulated kinemat- 
ics of the HLL resonance, including its mass, and on the 
particular model considered (0+, 0~, etc). Specifically, 
the overall phase space acceptance, implemented in the 
signal selection by means of the p^ and 77 requirements, 
produces the largest effects on the observables. This is 
shown in Fig. 6 for a resonance of mass 145 GeV/c^ gen- 
erated with no explicit angular correlations. Adding the 
angular correlations can enhance or reduce the overall 
selection efficiency depending on the details of the mul- 




-1 -0.5 0.5 1 -1 -0.5 0.5 1 



FIG. 4: Reconstruction resolution for the angular variables of 
X shown here for a resonance with mass 145 GeV/c^. The 
cos 02 and cos&i distributions are very similar in this case. 
Only events surviving the signal selection are included. All 
distributions are normalized to unit integral. 



tidimensional pdf. Our selection is 60% (74%) efficient 
for a 0+ resonance of mass 200 GeV/c^ (350 GeV/c^) 
as shown in Fig. 7. The same figure demonstrates that 
the efficiency has a non-trivial dependence on the na- 
ture of the spin correlations. Specifically, for a 0~ reso- 
nance of 200 GeV/c^ (350 GeV/c^) the efficiency is 60% 
(69%). With an absence of explicit spin correlations the 
efficiency for a 350 GeV/c^ resonance is 71%. 

We find that changes in the X distributions are 
strongly correlated with the kinematics of the off-shell 
e.g. for cos 62 the largest inefficiencies correspond to the 
kinematic configurations where at least one of the muons 
is soft. When the correlations between the variables w 
and appear explicitly in the differential cross-sections, 
as is the case for J—V^, the phase space acceptance ef- 
fects are amplified. The consequences on model discrim- 
ination are discussed in Sec. VII B. 

The shapes of the reconstructed w and distributions 
depend on the phase space acceptance both for electron 
and muon final states {H — )■ ZZ — > 2e2/i or 4e). Fig- 
ure 8 shows the relevant kinematic distributions. All the 
results concerning model discrimination, as a function 
of the number of observed signal events, will be nearly 
identical when the additional final states are included 
(2e2/i, 4e), especially when the off-shell Z mass is not 
used as an observable. This is not necessarily the case for 
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FIG. 5: Distributions showing systematic biases for a subset 
of the reconstructed variables X for a resonance with mass 
145 GeV/c^. Only events that survive the signal selection are 
included. All biases are negligible. 



results concerning the discovery of a resonance in these 
final states with respect to the background-only hypoth- 
esis, since different backgrounds need to be considered 
for electron and muon final states. 
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C. Fit definition and signal extraction 

The H — >■ ZZ signal events can be discriminated from 
SM backgrounds using an extended and unbinned ML 
fit. Since there is no resonant Afi background in the SM, 
the fit can use as a discriminating variable the 4/z mass 
distribution. In the presence of a sizable background due 
to fake Z candidates (such as top decays) the 2^ mass 
distributions can be included in the likelihood. Since 
this is not a conceptually different situation, we ignore 
this possibility and assume for simplicity that the only 
relevant background is given by events with two real Z 
candidates. We write the likelihood function as: 

^-^c^p(-E^.) (28) 




where Nj {j—S, B) represents the yield of each compo- 
nent, m\ is the 4/z candidate mass for the event i, and 



FIG. 6: The variables X used in this analysis for a 145 GeV/c^ 
resonance. The off-shell Mz* is required to lie a window be- 
tween 20 and 50 GeV/c'^. The shaded histograms are the 
ID distributions using a constant matrix element (i.e. no an- 
gular correlations included). The overlaid histograms show 
the same distributions for reconstructed events passing the 
Pt and r) signal selection after the detector parameterization. 
All distributions are normalized to unit integral. 



Ps["^] {Pb[^\) is the signal (background) distribution for 
the variable m. The pdjs for the signal and background 
components are described using the template distribu- 
tions from the simulation, as shown in Fig. 9 for m/f =250 
GeV/c^. This fit configuration is appropriate for the HLL 
characterization. 



D. Background subtraction 

In order to establish if a newly-discovered resonance 
is indeed the Higgs boson or not, a hypothesis test 
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FIG. 7: The analysis efficiency for 0"*", 0~ as a function of the 
resonance mass. The case with no correlations is also shown 
for comparison. 
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FIG. 8: Kinematic distributions for the variables cos Q\ (left) 
and 4> (right) for a 0^ resonance with mass 350 GeV/c^. The 
shaded histograms show the ID projections of the variables 
as described by the analytic pd/s. The overlaid histograms 
(blue, red) show the same ID projections for reconstructed 
events passing the pt and r\ signal selection after the detector 
parameterization for 4/.1 and 4e final states. All distributions 
are normalized to unit integral. 




200 220 240 260 280 300 

m^JGeV/c'j 



ibo 250 300 .150 400 450 500 550 600 
m^„[GeV/c^J 



FIG. 9: Distribution of the invariant mass for a sample of 
signal with mH~250 GeV/c^ (left), and background (right) 
ZZ events. 



is performed (see Sec. VII). In this context, a tool to 
disentangle signal and background events from the se- 
lected dataset is an important prerequisite. We use the 
sWeight [13] technique and reweight the selected dataset 
according to how likely each event is considered to be sig- 
nal by the fit. The sW eight technique is statistically op- 
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FIG. 10: The 4/i invariant mass distribution for a sample of 
Ns^lQ H ^ ZZ events with mii=250 GeV/c^ and iVs = 1000 
ZZ background events. The superimposed curves represent 
the likelihood function returned by an ML fit, with Ns, 
Nb, and 7714^ as free parameters (top). Comparison of the 
signal-only MC distribution of cosffi, with the background- 
subtracted distribution obtained with the sW eight technique 
(middle). Comparison of the background-only MC distribu- 
tion of cos 9i , with the signal-subtracted distribution obtained 
with the sW eight technique (bottom). 



timal when the discriminating variable (7714^ in our case) 
in the fit is uncorrelated with the subsequently used vari- 
ables {X in our case). On the upper plot of Fig. 10, the 
4/x invariant mass distribution is shown for a sample of 
iVs=70 H ^ ZZ events (with mH=250 GeV/c^) on top 
of Nb— WOO continuum ZZ background events, corre- 
sponding to a ~ 5(7 deviation from the background-only 
hypothesis. The superimposed curves represent the like- 
lihood function returned by an ML fit (with Ns, Nb, 
and ?7i4^ as free parameters). The middle plot shows the 
signal ^Weighted cos^i distribution. Similarly, the bot- 
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torn plot shows the background sWeighted cos^i distri- 
bution. The comparison of the two sets of points shows 
how the background (signal) subtraction allows one to 
recover the signal (background) distribution for the con- 
sidered variable in the given sample, the deviation from 
the expected pdfs being due to statistical fluctuations al- 
ready present at the MC level. 



V. STATISTICAL APPROACH 

In this section we discuss the statistical formulation we 
use to address comparisons between different hypotheses 
as well as relevant measurements for the characteriza- 
tion of an HLL resonance. We focus on four statistical 
approaches: 

• (1) Search analysis of a signal in the presence of 
backgrounds. 

• (2a) Comparisons between two "pure" spin-parity 
hypotheses (such as 0+ vs. 1"). 

• (2b) Comparisons between two spin-parity hy- 
potheses, with at least one of the two being an "im- 
pure" admixture of two pure HLL states (e.g. 0+ 
vs. a combination of 1"*" and 1~). This case is sim- 
ilar to (2a), except for the presence of one or more 
nuisance parameters. 

• (3) The measurement of mixing parameters in the 
case of impure Higgs look-alikes. 

In case (1) we consider two hypotheses. Hi is the "stan- 
dard Higgs signal plus background" , and Hq is the null, 
"background only" hypothesis. 

Cases (2) and (3) involve attempting to establish the 
nature of a newly discovered particle. Guided by our 
results on gPlots, we contend that it is a very good ap- 
proximation to confront two different "signal" hypothe- 
ses in the absence of background - the latter having been 
statistically subtracted. This assumes that a resonance 
mass peak has already been established. 

The case (2) hypotheses refer to an mn peak with 
two different interpretations. In the (2a) case the 
two hypotheses under consideration are simple, i.e. the 
corresponding likelihoods are fully specified once the val- 
ues X are fixed. In the (2b) case the unknown mixing 
angles for the impure hypothesis, referred to as ^ (and 
including e.g. various mixing angles ^ and 5 as discussed 
in Sec. VII F), are treated as nuisance parameters. The 
analysis in case (3) is a traditional parameter estimate, 
based on the ML fit, for which we obtain a confidence in- 
terval by using the Feldman-Cousins approach [48]. We 
discuss the three cases starting from the last. 



A. Coupling admixtures 

Consider the example of a one-parameter mixture of 
two types of HZZ coupling, such as the composite case 
discussed in Sec. HID. For a fixed value of the resonance 
mass rriH and the mixing angle ^, Eq. (23) is the theo- 
retical probability-distribution of the events as a function 
of the variables X for ZZ and ZZ* final states. The ex- 
perimental pdf is a numerical representation of the result 
of sieving -through a specific detector and its resolution, 
trigger and analysis requirements- a very large number 
of events, generated with the theoretical pdf oi Eq. (23). 
This experimental pdf, referred to as P, is a function 
P=PmHi^, X) of mn, (which is kept fixed through this 
exercise), ^, and X. The dependence on = {cos 9, $} 
is, in this example, exclusively a phase space acceptance 
effect. 

Many experiments with a fixed number of events Ng 
are simulated, assuming the same detector response. The 
probability of each event, evaluated with the experimen- 
tal pdf, is Pi. The likelihood of a given experiment is 
£(^) = Yli'jiPi- The experimentally measured value of 
the ^ parameter, ^ corresponds to the value that maxi- 
mizes C{^). The simulation is repeated many times, as a 
function of the true value of the mixing angle ^. Running 
many experiments one can derive the confidence interval, 
i.e. the range covering the true value of ^ for some confi- 
dence level and some measured value ^ [48] . 

It is customary to estimate the error (or the number 
n of standard deviations a) in the measured ^ from the 
expression £{£,max ± ncr) = £-{^max) — r? jl. While this 
method is accurate for large samples with Gaussian er- 
rors, it is not the one used to draw the a contours in 
Fig. 11 (where ^—i,xQ as given in Eq. (32) and in the 
similar figures of Sec. VII). Instead, the confidence level 
(CL) is evaluated measuring the frequency of a given re- 
sult in the set of generated pseudo-experiments. 

H = O"'' m^^ = 200 GeV/c" N^" = 50 
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FIG. 11; Confidence intervals for measured values of ^xq for 
a C-violating J=0 resonance witli a mass 200 GeV/c'^. 
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B. Confronting hypotheses 



Consider two hypotheses, EIo,i, for the spin-parity 
assignment of a signal candidate sample, detected via 
its ZZ mass peak and background-subtracted using the 
sPlot method. Large numbers of events are generated 
assuming each hypothesis and used to construct two 
unbinned experimental pdfs: Pmo i = Pmni^ I Ho.i). 
For our pure spin-parity cases, the simple nature of the 
hypotheses considered guarantees through the Neyman- 
Pearson (NePe) lemma [49] that the hypothesis test is 
universally most powerful. Next, we explicitly identify 
one hypothesis as Hq and the other as Hi. Addition- 
ally, we specify the test statistic A which we define as the 
log-likelihood ratio log[£(IHIi)/£(IHIo)]. Finally, we must 
a priori choose the acceptable probability level a of re- 
jecting Hp in favor of Hi, even though Hq is true (Type I 
error) . We generate a series of pseudo-experiments with 
a fixed number of events Ns to construct the pdf of A 
for the two hypotheses. A typical result is illustrated in 
Fig. 12. We first generate pseudo-experiments consider- 
ing Ho as true. For each experiment we construct two 
likelihoods £(Ho) = Yl^JiPsoi^i) for the correct inter- 
pretation of the true theory, and £(Hi) = Hilfi PMii^i) 
for its incorrect interpretation. With the ensemble of 
experiments one constructs the distribution P(A | Ho) 
with A EE log[/:(Hi)//:(Ho)]. The result is the leftmost 
(red) curve in Fig. 12. The exercise is repeated with the 
pseudo-experiments generated considering Hi as true and 
the result is the rightmost (blue) curve in the figure. An 
a priori chosen value of a implicitly defines a value A(a) 



P(A I H^) 




FIG. 12: Distribution of A for mH=200 GeV/c^ and A's=23, 
constructed with ~ lO'"* 
being confronted are EIo=0^ and Eli=0" 



a= / P(A|Ho)dA 

/a(q) 



(29) 



This fixed value A (a) implies that 



P(A|Hi)dA 



(30) 



is the probability of accepting Hq even though Hi is cor- 
rect (Type II error). The value 1 — /3 is called the power oi 
the test. When the real experiment is performed, a spe- 
cific value Aexp, is obtained for A. The associated p-value 
— P{A I Ho) dA , is compared to a to determine if 
the measurement favors one hypothesis versus the other. 

Instead of the a and /? values, the significance a is com- 
monly used. To convert to an equivalent number of ct's 
using Fig. 12 we calculate the same a-area in a Gaussian 
distribution centered at with a=l. The number n of 
a-equivalent standard deviations is obtained by inverting 



1 

/2^ 



dx f 



72 



(31) 



The a priori (subjective) choice of a (and subsequently 
/3 and corresponding significances) is heavily discussed 
in the literature. The Physical Review, for example, 
requires a 5a (3cr) significance to claim discovery (evi- 
dence). The caveat is, of course, that when one minimizes 
as much as possible the probability of an error of Type I 
(wrongly claiming a discovery) one risks making an error 
of Type II (and e.g. delaying the claim of a discovery to 
the next luminosity upgrade). 

A pure vs. impure HLL hypothesis test has an addi- 
tional complication due to the dependence of the likeli- 
hood function on the mixing angles ^ in at least one of 
the two hypotheses. In this case, we are testing the sim- 
ple (i.e. mixing angle independent) hypothesis against a 
class of alternative hypotheses, connected by the vari- 
ation of a continuous unknown parameter (s). The test 
is performed by comparing the simple hypothesis to the 
impure hypothesis with values of ^ that best fit the data. 

The impure vs. impure Higgs look-alike test is tech- 
nically identical to the pure vs. impure. Here, we try 
to exclude some value of the mixing angle parameter for 
one of the two composite hypotheses in favor of the al- 
ternative impure hypothesis, where the mixing angles are 
treated as nuisance parameters. With fixed mixing an- 
gles, one impure look-alike becomes a simple hypothesis 
(like a pure one) tested against an impure hypothesis. 



C. Higgs searches 

When searching for a new particle two hypotheses are 
tested against each other: the background-only. Ho, and 
signal plus background. Hi. 
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Assuming that the event distributions for signal and 
background are fully specified (an unrealistic situation 
in that the value of the Higgs mass and width are not 
known a priori), one still has to determine the signal and 
background yields. Hence, the likelihood function has a 
parametric dependence on at least one nuisance param- 
eter. There is no guarantee that the Neyman-Pearson 
construction is, in this case, the optimal hypothesis test 
one could perform. We arc, however, not concerned with 
what the optimal statistical test is, but rather on the 
physics content of the likelihood function. Our aim is 
to illustrate how different analyses that fully or partially 
exploit the information in the theoretical pdfs compare 
with each other. For this purpose it is sufficient to use a 
consistent statistic among the various cases and discuss 
their relative merits. We still perform a hypothesis test 
based on the likelihood ratio. The dependence on the 
nuisance parameters is removed through a maximization 
(profiling) of £(IHIo,i) relative to the nuisance parame- 
ter(s), prior to the construction of the likelihood ratio, as 
done for the case of impure hypothesis-testing discussed 
in Sec. VB. 

Given a specific analysis setting (i.e. a set of variables 
defining the likelihood function) we evaluate its discov- 
ery power by computing the significance (the number of 
ct's) as a function of the signal yield and for different 
values of the ratio of signal over background yields. We 
define an expected value for the signal to background 
ratio, (Ns/Nb), between the signal events constituting 
the m{ZZ) peak and the integral of the background dis- 
tribution in the same variable in the range 190 GeV/c^ 
to 600 GeV/c^. To address the uncertainties, we com- 
pare the two hypotheses for various pre-selected values 
of {Ns/Nb), in a large range including and bracketing 
the central current expectation. The likelihood for Hq 
is then that of Eq. (29), expressed as a function of the 
angular variables at fixed mjj, as opposed to a function 
of only niH- 

When adding the X variables to the likelihood, one 

should consider the event-by-event dependence of their 
pdf on the value of mn- This can done using a different 
X pdf for each bin of the template functions of Fig. 9. 
This step is straightforward when performing the real 
analysis, but CPU intensive when performing hundreds 
of billions of pseudo-experiments. The resonance mass is 
narrow enough for the peak to be determined, in which 
case the results are very close to the ones obtained with 
the full mass-dependence of the X pdf. For simplicity we 
did not include the finite width of the resonance in the 
likelihood. 

In our search results we compare the significance, 
as given by an mn-based peak search, with the cor- 
responding quantity following from the whole angular- 
distribution analysis. In the case of a discovery test, the 
p-value of any toy experiment is compared to the equiv- 
alent of a > 5 0- significant p-value, in order to establish 
if a discovery could be claimed for that experiment. By 
repeating the exercise many times, we can associate a 



probability to the discovery potential. The 5 a conven- 
tion fixes the value of a for the hypothesis test, as well 
as the value of /3 for a given likelihood function. 

VI. SIGNAL SIGNIFICANCE USING THE 
ANGULAR INFORMATION 

As described in the two previous sections, discrimina- 
tion of Higgs look-alikes first requires an event sample 
following a putative Higgs discovery. As noted already, 
the search analysis could be model-independent, relying 
only on the reconstruction of a resonant excess over non- 
resonant backgrounds. In this case a discovery is com- 
pletely factorizcd from its characterization. 

Despite the natural factorization between discovery, 
HLL discrimination based on production, and HLL dis- 
crimination based on decay, it is important to check the 
consistency of the entire chain of analysis. This is espe- 
cially true for the small datasets considered here, where 
we demonstrate HLL discrimination with datasets com- 
parable to the original discovery sample. 

A powerful check is to compare the signal significance 
of two nominal analyses: 

• An "m(ZZ) only" fit, for which the discrimination 
between signal and background is given only by the 

ZZ invariant-mass peak. This is an example of 
a model-independent discovery analysis (although 
not necessarily the actual discovery analysis used 
in the experiment). 

• An "m{ZZ)+X'' fit, in which the pdf for the an- 
gular variables X is also included. Thus here we 
are using the angular information to improve the 
discrimination of the signal from the background, 
rather than discriminate SM Higgs from HLLs. The 
pdf of X corresponds to the value of mjj as ex- 
tracted from the fit. 

We compare the signal significance of the two analyses, 
corresponding to different physics content for the likeli- 
hood function. A common statistical framework is used, 
since we arc interested to compare the physics perfor- 
mance rather than determining the optimal statistical 
approach. The overall normalization is obtained by as- 
suming Y^=10 TcV with a corresponding SM Higgs pro- 
duction cross section [43]. 

A direct comparison of the two analyses in a common 
framework is a way to quantify the price to pay in order 
to run a completely model-independent search. At the 
same time, it is a consistency check on the HLL discrim- 
ination analysis, since the background events are them- 
selves Higgs imposters. If, as we claim, HLL discrimi- 
nation is possible with datasets not much larger than, or 
identical to, the original discovery sample, then we should 
also find that the "m{ZZ)+X" fit offers comparable im- 
provements in signal significance over the ^^m{ZZ) only" 
fit, for similarly small datasets. 
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To make the likelihood comparison meaningful, a com- 
mon fit setting is used. For the ZZ invariant mass, we 
consider the range 190 < m^f < 600 GeV/c^. The fit 
configuration is specified by the nominal expected signal- 
over-background yield ratio (-/V5 /A^s) and by the nominal 
number of signal events Ng- We consider different sce- 
narios by fixing different values of (Ns/Nb) and perform 
the study as a function of Ns- 

For each fit configuration we run a set of toy Monte 
Carlo experiments. The actual number of background 
events are generated according to a Poisson distribution 
around the nominal value, and the event-by-event val- 
ues of the variables used in the fit (to// and, if used, 
X) are randomly generated according to the signal and 
background pdfs. The fit is then performed for each toy 
sample, maximizing the likelihood as a function of the 
signal and background yields and the value of to//. The 
sets of fits provide a distribution for the statistical sig- 
nificance obtained in a particular experiment. 

This is summarized in Figures 13-15. The two bands in 
the figures correspond to the spread (at 68% confidence 
level) for the signal significance achieved in a single ex- 
periment, as a function of the signal yield Ns, for the 
"to(ZZ) only" fit (light band) and the ''m{ZZ)+X" fit 
(dark band). The horizontal lines show the 3a and 5a 
thresholds {evidence and discovery, in the usual conven- 
tion). The intersection with each band provides a cor- 
responding range for the needed signal yield, the spread 
being due to statistical fiuctuations. For a correct inter- 
pretation of the separation between the two bands, one 
should consider that the statistical fluctuations in the two 
fits are strongly correlated since they both depend on the 
invariant mass observable, and background fluctuations 
for this mass distribution will be the same for both. 

Figure 13 has the case of an to//=200 GeV/c^ SM 
Higgs boson, while Fig. 14 illustrates similar results for 
an TO//=350 GeV/c^ SM Higgs boson. For each mass, dif- 
ferent values for (Ns/Nb) are considered; we show here 
the results for {Ns/Nb)^1/5, I/IO for toh==200 GeV/c^ 
and {Ns/Nb)=1/10, 1/20 for to//=350 GeV/c^. We note 
that better discrimination between the signal and back- 
ground in the higher mass case (compared to the lower 
mass) especially in the invariant mass observable; despite 
the lower cross section this results in higher signiflcance 
for the higher mass case for the same luminosity. 

Similarly, Fig. 15 has the results for m,//=200 GeV/c^ 
and TOj/=350 GeV/c^ pseudoscalar HLLs. Here the input 
parameters (such as the cross section) are assumed to be 
those of a SM Higgs boson; only the shape of the pdfs 
defining the likelihood (and in particular the correlations 
between the angles) are different from the SM case. The 
angular distributions and correlations for a pseudoscalar 
resonance are similar to those of the ZZ background, 
resulting in a much smaller improvement in the signal 
significance over the ^^m{ZZ) only" fit, and thus a smaller 
distance between the two bands in the plots. 
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FIG. 13: Distribution of signal significance for a 200 GeV/c^ 
SM Higgs boson decaying in the H — >■ ZZ — > 4/t channel for 
pp collisions with -^5=10 TeV. The mean signal to background 
ratios used are {Ns/Nb)^ 1/5 (left) and 1/10 (right). 
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FIG. 14: Distribution of signal significance for a 350 GeV/c^ 
SM Higgs boson decaying in the H — >■ ZZ — > 4fi channel for 
pp collisions with y^=10 TeV. The mean signal to background 
ratios used are {Ns/Nb}^ 1/10 (left) and 1/20 (right). 
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FIG. 15: Distribution of signal significance for a 200 (left) and 
350 (right) GeV/c^ pseudoscalar particle in the 0~ — >■ ZZ — > 
4/i channel for pp collisions with -^5=10 TeV. The mean sig- 
nal to background ratios used are {Ns /Nb}~ 1/5 (left) and 
1/10 (right), and assumes an NLO SM Higgs production cross 
section for the pseudoscalar. 
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VII. RESULTS 

We present results for three HLL masses: m/f=145, 
200, and 350 GeV/c^, using pseudo-experiments built 
with the full X pdf. 



A. 0+ vs. 0~ 

We consider here two different "pure" scalar hypothe- 
ses: 0+, corresponding to a SM Higgs, and 0~, a pseudo- 
scalar. Neither of these possibilities has an explicit 
dependence on the angles in their differential cross- 
section, meaning that only the variables w (and the off- 
shell Z mass, ■m2=Mz', for niH < 2Mz) are used to 
discriminate between the two hypotheses. 

In Fig. 16 we show the distributions in (j> and cos 6*1 
at m//=350 GeV/c^ for J^=0+ and 0". These angular 
variables (along with cos 02, whose distribution is identi- 
cal to that of cos 01 except when Z2 is off-shell) provide 
the discrimination between these two hypotheses at all 
masses mjj. For masses rriH below the 2Mz threshold, 
the kinematic factors in Eqs. (14), (15) result in the differ- 
ential cross section dependences on the off-shell Z mass 
Mz* that differ for the 0+ and 0~ cases. This is illustated 
in Fig. 17 (left) for r7ii/=145 GeV/c^. For all the discrim- 
inating variables we consider, the ability to distinguish 
between two hypotheses is degraded when their corre- 
lations are neglected. This is shown in Fig. 17 (right) 
where we present the results of the NePe hypothesis test 
between 0+ and 0~ for likelihoods built using different 
subsets of variables and correlations thereof. Specifically 
P{Mz',uj) denotes the use of the full set of variables 
while in P(aJ) the probability distribution of Mz* is ig- 
nored. The product of all one-dimensional probabilities, 
ignoring correlations, is Hi Pi^i)- As expected, the like- 
lihood including all discriminating variables and their 
correlations is optimal. The other two definitions give 
similar results. We note that, regardless of the results, 
the use of Yii Pi^i) is an improper approximation, since 
the Xi variables are far from being uncorrclated. 

The significance for discriminating between the 0+ and 
0^ hypotheses (assuming one or the other to be correct), 
as a function of Ng, where Ng is the number of observed 
signal events, is shown in Fig. 18 for mjy=145, 200, and 
350 GeV/c^. In all cases, results correspond to the case 
where Hi is the true hypothesis (see Sec. V). The model 
discrimination is based on a NePe test between these sim- 
ple hypotheses with test statistic log(£[0+]/£[0~]). The 
variables uj (and Mz' , when applicable), along with their 
correlations, are used in the likelihood construction. The 
significance for rejecting one hypothesis in favor of the 
other at the time of 5 cr excess (see Sec. VI) is better 
than 3cr for m//=145, 200, and 350 GeV/c^ while a 5ct 
discrimination can be achieved with twice the observed 
signal events (less than ~40 events in both mass cases 
presented here). 




FIG. 16: Distributions of the variables (left) and cos Si 
(right) for 0^ and 0^ resonances with m£r=350 GeV/c^. All 
distributions are normalized to a unit integral. 
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FIG. 17; Left: Distribution of Mz- for 0+ and Q- H ZZ* 
decays at mH = 145 GeV/c'^, normalized to a unit integral. 
Right: Median significance for rejecting 0~ in favor of 0^, 
assumed to be correct, as a function of Ns- The different 
likelihood constructions are specified in the text. 



B. 0+ vs. 1~ and 1 + 

We consider here two different "pure" J=l models 
specified by their HZZ couplings: "vector" (J=l^) and 
"axial vector" ( J=l+). Unlike in the 0+ case, the differ- 
ential cross sections have non-trivial dependences on the 
Z-production angles f1 that provide additional discrim- 
ination between 0+ and J—\. In Fig. 19 we show the 
distributions for some of these variables. 

We note that when a J—\ resonance decays in ZZ* , the 
distributions in c\ = cos 0\ and ci = cos 02 are not any 
longer qualitatively similar, as illustrated in Fig. 20 (in 
striking contrast to the J=0 cases) . Figure 20 also shows 
the very strong correlation between Mz- and cos 62 ■ In 
the J=l~ case, this asymmetric effect arises from the 
configurations in which the object, in its rest system, is 
polarized along the direction of motion of one of its Z- 
decay products. These helicity configurations result in an 
addend proportional to m\s\c\+m\s^c\ in the pdf, which 
can be rewritten as 2Af|(sf + s^ — s^Sj) — m?^s\{2 — s|). 
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FIG. 18: Significance for rejecting in favor of 0"*", assuming 
0^ is true (left), and vice- versa, 0^ -f-*- 0~ (right), for m£f =145, 
200 and 350 GeV/c^ (top, middle and bottom). The dashed 
central line is the median significance. The 1 and 2 a bands 
correspond to 68% and 95% confidence intervals, centered on 
the median. 



with = — m\. The second term is l-f->-2 asymmet- 
ric at fixed vrid and induces the difference between the ci 
and C2 one-dimensional distributions. In the J=l^ case 
the asymmetric pdf term is, in the notation of Appendix 
D, 2M^mlsj+2M^mlsl - {M^m'l + MfmDsjsl, and its 
origin is similar. These asymmetric effects significantly 
enable the discrimination between J=l and J=0 models 
when iriH < 2Mz- 

In Fig. 21 we compare the discrimination between the 
0"*" and 1"*" hypotheses for likelihood definitions that ex- 
ploit different variables. The obvious qualitative conclu- 
sion is that likelihoods defined in terms of pdfs containing 
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FIG. 19: Distributions of the variables cos0 (upper left), $ 
(upper right), 4> (lower left) and Mz» (lower right, mH = 14:5 
GeV/c^) for 0+, 1" and 1+ resonances. All distributions are 
normalized to a unit integral. The angular disctributions are 
shown for mfr=350 GeV/c^. 



the most information are the most performant. The fig- 
ure shows the relative discriminating power of the differ- 
ent choices: P(ai , • • • , oat) denotes N-dimensional pdfs in 
the correlated variables {ai, • • • , a^}- Yii is con- 

structed from one-dimensional pdfs for all variables, ig- 
noring (erroneously) their correlations. P(uj\{^l)T^Yi) are 
pdfs including the variables w and their correlations, but 
with the hypothesis 1"^ represented by a pdf'va which de- 
pendence on the variables r2={<i>, cos 0} has been inte- 
grated out of the analytic differential cross-section. The 
likelihood P(aJ |(il)TH) performs badly relative to P(w), 
where the two differ only in that the first construction im- 
plicitly assumes a uniform 47r coverage of the observed 
leptons, as if the muon pT and 77 analysis requirements 
did not depend on the Vt angular variables. The primary 
reason for this difference is the strong correlation between 
the variables $ and in the J=l pdfs, which causes phase 
space acceptance sculpting of the distribution to alter 
the (j) distribution, as discussed in Sec. IV. 

The significance for discriminating between the 0^ and 
1~ (1^) hypotheses, as a function of Ns, is summarized 
in Fig. 22 (Fig. 23). The full correlated set of variables 
VL, w, and Mz* (when applicable) is used in the like- 
lihood construction. The discriminations are based on 
the NePe tests between simple hypotheses with statis- 
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FIG. 20: Distributions in cos 61 (left) and cos 6^2 (right), for 
0^, 1~ and 1"*" resonances with mass 145 GeV/c^, normalized 
to a unit integral. The very strong correlation between Mz* 
and cos 02 is also shown (bottom) for the J—1 case. 



as a function of Ns, is summarized in Fig. 27 for m/f=200 
and 350 GeV/c^. For these tests the variables f1 and cJ 
and their correlations were used in the likelihood. Model 
discrimination is based on the NePe test between sim- 
ple hypotheses with test statistic log(£[0+]/£[2+]) and 
log{C[0+]/C[2-]). 



tic log(£[0+]/£[l-]) (log(£[0+]/£[l+])). The discrimi- 
nation between 0"*" and 1~ or 1"*" is similar. 

C. 0+ vs. 2+ 

We consider one "pure" spin 2 model: a J— 2^ heavy 
graviton-like resonance. A J=2 object has pdfs with non- 
trivial dependence on the angles up to quartic order 
in COS0. In Fig. 24 we show the corresponding distribu- 
tions in the Jl variables for m/f=200 and 350 GeV/c^. 
The ability to discriminate between the 0+ and J— 2 hy- 
potheses improves with increasing resonance mass. De- 
spite the presence of quartic terms in cos 8 in the 2+ pdf 
and the absence of this variable in the 0+ pdf, their cor- 
responding one-dimensional pdfs are similar for the 0^ 
and 2+ resonances for values of mn close to 2Mz, as 
shown in Fig. 24. Similar behavior is observed in the dis- 
tributions of cos 6*1 and cos 6*2, as illustrated in Fig. 25. 
Nevertheless, the inclusion of all angular variables and 
their correlations improves the discrimination power be- 
tween these hypotheses as shown in Fig. 26. 

The significance for discriminating between 0^ and 2+ 



D. Other pure J comparisons 

If a resonance discovered in the 4€ final state does not 
have the quantum numbers of the SM Higgs boson, it 
is likely that 0+ will be rejected in favor of other pure- 
J-PC" hypotheses. The issue of abandoning a particular 
J^*^ in favor of others becomes a combinatoric exercise, 
where the compatibility of the data is assessed against 
each possible pair of hypotheses in a simple NePe test, 
in view of selecting the optimal assumption. In this sec- 
tion we present the expected results for these comparison 
tests, as a function of the observed number of events Ng. 
Following the results of the previous section, we always 
use the full set of angular variables plus, when appro- 
priate, Mz", corresponding to the optimal statistic for 
model discrimination. 

The discrimination between the 0^ hypothesis and the 
pure J=l ones is very similar to the case of distinguish- 
ing the latter from O"*", described in Sec. VII B. The pdf 
for 0^ has also no explicit dependence on the angles VL. 
Differences in the pdfs of these variables provide discrimi- 
nation between 0~ and J=l states, as Fig. 28 illustrates. 
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FIG. 22: Significance for rejecting 1 in favor of 0"*", assuming 
0^ is true (left), or vice-versa (0^-f-)-l~, riglit), for m£f=145, 
200 and 350 GeV/c^ (top, middle and bottom). 



FIG. 23: Significance for rejecting 1"*" in favor of 0"*", assuming 
0^ is true (left), or vice-versa (O""" -f-^- 1^, right), for m£f = 145, 
200 and 350 GeV/c^ (top, middle and bottom). 



The one-dimensional AIz' pdfs are similar for 0^ and 
0"*", as well as for 1^ and 1+, while the differences be- 
tween the two J-values are maximal. The cos0i.2 dis- 
tributions for J=l have qualitatively different behavior 
when niH < 2Mz, as discussed in Sec. VII B. This re- 
sults in the J—1 cos 6*1 (cos 6*2) distribution being more 
"0"-likc" ("0+-like"), resuhing in similar levels of dis- 
crimination between J—0 and J—1. 

The expected significance to distinguish the 0^ and 1^ 
(1+) hypotheses, as functions of Ns, is shown in Fig. 29 
(30). The mH=U5 GeV/c^ results and the ones for 0+ 
vs. J=l (Figs. 22 and 23) are nearly identical. A similar 
comparison of 0^ vs. J— I (Figs. 29 and 30) with O"*" vs. 
J=l (Figs. 22 and 23) for toj^=200 GeV/c^ reveals that 
it is more difficult to discriminate between 0"*" and J—1 



at this mass. This is predominantly due to the pdfs for 
the angles cos0i^2 (which are similar for 0+ and J=l for 
■niH > 2Mz). 

The distributions for the variables D, and uj for all 
the pure J^^ hypotheses considered in our analysis are 
shown in Fig. 31, for m/f=200 GeV/c^. The n distri- 
butions are nearly identical for the two J=0 cases, since 
they are only induced by detector limitations. 

The potential to distinguish between 0^ and 2+ reso- 
nances is shown in Fig. 32 for rriH— 200 and 350 GeV/c^. 
If both of the J=0 cases are excluded in favor of J=l 
or J=2, one needs to discriminate between the latter. 
Relative to J=0 case, the two pure J=l resonances have 
the most similar pdfs, as we saw in Sec. VII B while com- 
paring them to the 0+ case. The comparison to the J=2 
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FIG. 24: Distributions of the variables cos (left) and $ 
(right) for 0^, 2+ resonances with masses of 200 and 350 
GeV/c^ (top, bottom). All distributions are normalized to a 
unit integral. 



FIG. 25: Distributions of the variables cos 9i (left) and (j> 
(right) for 0^, 2+ resonances with masses of 200 and 350 
GeV/c^ (top, bottom). All distributions are normalized a 
unit integral. 



case reflects the same limitation, as shown in Fig. 34 (35) 
for 1- vs. 2+ (1+ vs. 2+). 

The hardest differentiation is between 1~ and 1+. Fig- 
ures 28, 31, and 33, show that the one-dimensional cos 9, 
cos 01, cos 6*2, and Mz* pdfs are similar. While the $ and 
(j) pdfs provide some discrimination, the phase space ac- 
ceptance tends to sculpt the $ distributions (and (f> dis- 
tribution through correlations) in ways that render the 
two cases very similar. The significance for distinguishing 
between the two J=l cases is shown in Fig. 36. We con- 
clude that the discriminating potential is weakest for 1"*" 
vs. 1~, for all niH- We revisit this result in Sec. VUG in 
the context of measuring mixing parameters in a general 
J=l Lagrangian. 



E. 0+ vs. mixed scalar states 

Consider the vertex Feynman rules of Eq. (9) for 
the most general Lorentz-covariant coupling C^a of a 
spinless object to a Z pair. Rather than studying the 
general case, for which any of the quantities X to Q can 
be nonzero, we investigate three cases, each with only 
two non-vanishing types of coupling, resulting in one 
free mixing "angle" and an overall normalization (which 
we ignore): 
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FIG. 26: Median significance for rejecting 2 in favor of , 
assuming 0^ is true, for the different likelihood constructions 
discussed in the text. 
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FIG. 27: Significance for rejecting 2^ in favor of 0"*", assuming 
0^ is true (left) or vice-versa (0^<~>2+, right), for mH~200 
and 350 GeV/c^ (top, bottom). 



FIG. 28: Distributions, normalized to a unit integral, of the 
variables cos^i (top left), cos 02 (top right), (j> (bottom left) 
and Mz* (bottom right) for 0"'', 0~, 1^ and 1~ resonances 
with m£r=145 GeV/c^. 



• X 0, P 0: A scalar whose ZZ coupling vio- 
lates CP, described in terms of an angle £^xp as: 

C^a oc cos(^xp) g^a + sin(^xp) e^_,aPiP2/Ml 

• X 7^ 0, Q 7^ 0: A scalar whose ZZ coupling vio- 
lates C, described in terms of an angle as: 

C^a oc COs(^xq) g^ia + isin(^XQ) £f^aPlP2/Mz 

• X 0, Y 0: A composite 0+, parameterized in 
terms of an angle as: 

oc cos(^xy) ffpa - sin(^xy) /cafc^/Ml 

As a function of Ns we estimate the significance with 
which one can determine: 

• (a) What range of values of the angles can be ex- 
cluded in favor of a pure 0+ for a SM-like resonance; 

• (b) Whether a pure 0+ can be excluded in favor 
of a non-trivial mixture when the resonance corre- 
sponds to one of the three mixed cases discussed 
above. 



We consider first the example of a CP- violating HZZ 
coupling with m^f=350 GeV/c^. 

To address (a) we construct a series of simple hypothe- 
sis tests of the type we considered earlier for distinguish- 
ing between pure J^'^ states. Specifically, for a given 
number of observed signal events at a fixed value of mn, 
we perform a NePe test between two simple hypotheses: 
that the resonance is 0+ (denoted hypothesis Hi ) or that 
the resonance is J=0 with ^xp fixed to a specific nonzero 
value (denoted hypothesis Hq). The test statistic we use 
is \og[C''P{^xp)/C{0+)], where £(0+) and C'Pi^xp) 
denote the likelihoods for a set of events agreeing with 
the hypotheses Hi and Hq, respectively. The test can- 
not be performed for £,xp=0, since in this case the Hq 
CP-violating hypothesis we want to test reduces to the 
alternative Hi hypothesis (the CP-conserving SM Higgs) . 

The result of this test is the significance with which 
hypothesis Hq can be rejected in favor of the hypothesis 
Hi, or similarly, the significance with which a particular 
value of £^xp can be excluded in favor of the 0"*" hypothe- 
sis. This test is then repeated with different fixed values 
of (,xp, i-e. different NePe tests with different hypothe- 
ses Hq. The results for a large ensemble of such tests are 
shown in Fig. 37. Here, Hq = 0"^^ denotes the simple 
J=0 CP- violating hypothesis with (,xp fixed at values 
chosen on the x-axis. 
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FIG. 29: Significance for rejecting in favor of 1 , assuming 
1~ is true (left) or vice- versa (0~ -f-)- 1~, right) for m£r=145, 
200 and 350 GeV/c^ (top, middle and bottom). 



FIG. 30: Significance for rejecting in favor of 1"*", assuming 
1^ is true (left) or vice-versa (0~ ^ l""", right) for m£r = 145, 
200 and 350 GeV/c^ (top, middle and bottom). 



In this example we see that, for A^s=50, the signifi- 
cance for excluding a CP- violating coupling exceeds 3 a 
for ICxpI > 0.5 and 5cr for I-^a'pI > 0.9. 

In addressing (b) we cannot construct a simple NePe 
test between O"*" and a fixed-fxp hypothesis. Instead, we 
treat ^xp as a nuisance parameter and choose a value, 
^jfp, that maximizes the CP- violating likelihood for the 
given set of observed events. Specifically, we fix i^xp at 
a particular value (the "true" value) to generate events 
and perform NePe tests comparing ^xp=0 (denoted hy- 
pothesis Ho) and ^xp = ^xp (EIi). This test is repeated 
for many different values of the fixed "input" i,xp- 

An example of results from an ensemble of these tests 
is shown in Fig. 38. Because of the addition of a nui- 
sance parameter, the figure's interpretation is not simply 
related to the interpretation of Fig. 37, which answered 
question (a). What Fig. 38 shows is the expected sig- 
nificance with which one can exclude the SM hypothesis 
in favor of the CP- violating hypothesis with ^xp=KxPi 



as a function of the true value of ^xp (given on the x- 
axis). No a priori knowledge of the actual value of ^xp 
is required to perform this test. From Figs. 37 and 38 
we observe that the expected significances are symmetric 
around .^xp^O. This is due to the prf/s of the "pure 0+" 
and "pure 0~" terms being even under ^xp —£,xp, 
while the T-odd interference term vanishes under the in- 
tegration of cos 01, cos 6*2 or (p. We shall see that there 
are exceptions to this trivial statement. Comparing these 
two figures we observe a remarkable similarity of the sig- 
nificances of the two tests. Since two different statistics 
are used, this is somewhat of a coincidence. To explain it, 
consider the example with ^xp=7r/5, which corresponds 
to vertical sHces of Figs. 37 and 38. We denote the two 
different test statistics Af^''=log[£^^(^xp)/'C(0+)], with 
^xp fixed at its true value, corresponding to a simple hy- 
pothesis test and A"^''=log[max/:^-P(^xp)//:(0+)], pro- 
filed to the value ^xp at which it peaks. The distributions 
of A'^'' and A"^'^^ are shown in Fig. 39. 



22 



m„ = 200 GeV/c- 



— 0* — r 2* 

— 0- -1* 



m„ = 200 GeV/c- 



— 0* — r 

—0- - r 




© 0.04 




■0.5 0.5 1 

COS 



1 2 3 4 5 6 



m„ = 200 GeV/c- 



-0* — r 2* 
-0- -1* 



m„ = 200 GeV/c- 



-0- - r 



CD 



0.06 
0.055 

0.05 
g 0.045 

0.04 
0.035 

0.03 





-1 



-0.5 0.5 1 1 2 3 4 5 6 

COS 9. 



FIG. 31: Distributions of cos 6 (top left), $ (top right), 
cos 9i (bottom left) and 4> (bottom right) for all the pure J^'^ 
choices we study, for m_f/=200 GeV/c^ . All distributions are 
normalized to a unit integral. 



In the top figure the bell-shaped curves P(A^'^|0+) 
and P(A*^''|0"^'^) are characteristic of a simple hypothesis 
test. The distributions of A™'^'^ have a sharp cut-off at 
^max_Q^ since the O"*" model is a member of the O'^'^ fam- 
ily with S,xp=0, and ma,xL^^{ixp)/L{0'^) > 1, which 
are also features characteristic of this type of test. 

The reason for two very different hypothesis tests to 
end up in the similar-looking results of Figs. 37 and 38 is 
that the statistically-significant features of the different- 
looking P(A) distributions shown in Fig. 39 are actually 
very similar. PiA^^'lO^P) and P(A'"^''|0'^-P) differ, but 
the distributions of p close to the maxima are localized 
around the true input value, their median values and 
68% and 95% confidence intervals are nearly identical 
(try to tell apart the two vertical dotted lines in the lower 
half of Fig. 39, at A ~ 7). Also, the tails of one-minus- 
cumulative distributions for P(A^''|0+) and P(A'°^^|0+) 
coalesce for p- values exceeding 2 a significance, despite 
large differences in the distributions themselves. 

In Fig. 40 we show the results for the distinction be- 
tween pure 0+ and CP-violating J—0 hypotheses for 
mi/=145 and 200 GeV/c^. For 771^=145 GeV/c^, the 
"flat" behavior around £,xp—0 is due to the coupling 
strength of the 0"*" part relative to 0~, an order of mag- 
nitude larger for mjj= 145 GeV/c^ and closer to unity 
for the higher uih values. The corresponding results at 
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FIG. 32: Significance for rejecting 0~ in favor of 2"*", assuming 
0~ is true (left) or vice-versa (0~ -O- 2"'", right) for mH~200 
and 350 GeV/c^ (top, bottom). 
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FIG. 33: Distributions of cos 8 (top left), $ (top right), 
cos 9i (bottom left) and if) (bottom right) for all the pure J^'^ 
choices we study, for mij=350 GeV/c^. All distributions are 
normalized to a unit integral. 
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FIG. 34: Significance for rejecting 1~ in favor of 2"'', assuming 
2+ is true (left) or vice-versa (1~ •f-^2''', right) for m£f=200 
and 350 GeV/c^ (top, bottom). 
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FIG. 35: Significance for rejecting 1+ in favor of 2"*", assumed 
to be correct (left) or vice-versa (l^-f->2^, right) for mH=200 
and 350 GeSf j(? (top, bottom). 
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FIG. 36: Significance for rejecting 1 in favor of 1"*", assuming 
1^ is true (left) or vice-versa (1~ •<->■ l""", right) for m£r = 145, 
200 and 350 GeV/c^ (top, middle and bottom). 



m_f/=350 GeV/c^ are those of Figs. 37 and 38. 

The next mixed J=0 case that we consider is that of a 
C- violating scalar, with mixing angle (,xQ- This scenario 
is very similar to that of the CP-violating scalar: only 
the interference term between the 0"*" and 0^ amplitudes 
is different (C-odd, instead of T-odd). 

The expected results of hypothesis tests distinguishing 
between a C-violating scalar and a 0"*" state are shown 
in Fig. 41. Comparing this figure with Figs. 37, 38 and 
40, we observe identical behavior in all the results. This 
shows that the relative strength between the 0+ and 0~ 
parts of the matrix element squared, rather than the na- 
ture of the interference term, is the most relevant factor 
in resolving the values of ^jfp and ^xq- 

If a pure 0+ hypothesis is rejected in favor of both 
Cjstp 7^ and fxQ 7^ 0, the next question would be 
whether it is possible to distinguish between these two 
cases. To address this question, we perform a series of 
hypothesis tests similar to the one described to answer 
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FIG. 37: Significance for excluding values of ^xp in the CP- 
violating J—0 hypothesis in favor of the 0^ one, assumed to 
be correct, for mH=350 GeV/c^ and Ns=50. The dashed 
line corresponds to the median of the significance. The 1 and 
2 a bands correspond to 68% and 95% confidence intervals 
centered on the median value. 
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FIG. 38; The significance for excluding a pure 0^ in favor of a 
CP- violating HZZ coupling {(,xp 7^ 0), assuming the latter 
to be correct, with ^xp given by its x-axis values. Example 
for Ns=50, mfr =350 GeV/c^. Dashed line and bands as in 
Fig. 37. 



FIG. 39: Distributions of the two statistics A, defined in the 
text, for mjy=350 GeV/c^ and 7Vg=50. The hypotheses are 
Ho=0+, and Hi^Q-^^ with the CP-phase ^xp fixed at n/5. 
(Top) Probability distributions P(A|e). (Bottom) The same 
with the 0^ results traded for 1 minus their cumulative values. 
The two nearly indistinguishable vertical dotted lines corre- 
spond to the median values of the P(A|EIi) distributions. 



type (b) questions. Specifically, we first assume a given 
CP- violating 7^ as "true" . We then assess the ex- 
pected significance with which particular values of S.xQ 
can be excluded in favor of the true hypothesis. Hence, 
for each fixed value of ^xp we perform a test against 
the C-violating case using a fixed £,xq- The test statis- 
tic is A = \og[max L^^{ixp)/L{^XQ)], where the O'^'^ 
hypothesis is simple (fixed £,xq) and L{£_xp) is profiled 
"experiment-by-experiment" . The test is repeated over 
a matrix of values for ^xp and S,xQ- Next, we switch 
the roles of the hypotheses to assess the significance for 
excluding given values of ^xp in favor of ^xQ 7^ 0. The 
results are shown in Fig. 42. The color-coded z-"axis" is 
the median of the significance for ruling out the hypoth- 
esis Hq with the value of given on the y-axis in favor 
of the Hi hypothesis with 7^ 0, assumed to be correct 
for ^Hj^ -values chosen on the x-axis. 

The similarities between the C- and CP- mixed scalars 
are reflected in the y-^r^x symmetries of Figs. 42. More- 
over, switching the roles of the two hypotheses (compar- 
ing the figures on the left with those on the right) one 
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FIG. 40: Left: Significance for the exclusion of values of a 
CP-violating £^xp 7^ in favor of 0"^ {^xp~0), assumed to be 
correct. Right: Significance for excluding a pure 0^ in favor 
of ^xp 7^ 0, assumed correct with ^xp given by its a::-axis 
values. Results for mfr = 145, 200 GeV/c^ (top, bottom) and 
iVs=50. 



only sees small changes. Still, the fact that the diago- 
nals (ICxpI = I^xqI) are not all at the same significance 
shows that the tests are sensitive to the differences be- 
tween the T- and C-odd interference terms, but it would 
require an order of magnitude larger Ns to draw 5 cr-level 
conclusions over most of the {^xPtS,xq) plane. For ex- 
ample, we show in Fig. 43 the significance with which 
one can distinguish between the two func- 
tion of the number of observed events, for £,xy,xq—t^/4: 
and mi/=200 GeV/c^. The ambiguity between CxW"^ 
-^xp', UqHxT and ^xQ = -£.Tp' would be very 
hard to lift. 

The last J=0 mixed case that we consider has unique 
features; this is the "composite Higgs" in which a term 
oc fc^fcy is present in the HZZ coupling. This case is dif- 
ferent from the previous ones in that a composite scalar 
has well defined J^'-^ =Q^^ , regardless of the value of the 
angle (^xy characterizing the mixing between its pointlike 
and derivative couplings. As a consequence, the angular 
integrals of their interference term do not vanish, and 
there is no symmetry around ^xy=0. All the terms in 
the prf/ having the same discrete symmetries and similar 
angular dependences; there happen to be large cancel- 
lations in the pdf for a 'critical' mj^-dependent value of 
S,XY, as in the example shown in Fig. 44 for the fully 
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FIG. 41: Left: Significance for excluding values of a C- 
violating £^xq 7^ in favor of 0^ (Cxq=0), assumed to be 
correct. Right: Significance for excluding a pure 0"*" in favor 
of CxQ 7^ 0, assumed correct for the ^xQ-values on the i-axis. 
Hypothesis tests are for m/f=145, 200 and 350 GeV/c^ (top, 
middle and bottom), for A'^s=50. 



angular-integrated result. 

The appearance of an order of magnitude enhancement 
of the squared matrix element in Fig. 44 for 0(1) values 
of (,XY can be regarded as an artifact of our choosing 
a rather low mass scale {Mz) in the definition of the 
dimensionless coupling Y in Eq. (9); if e.g. we instead 
chose the compositeness scale at mjif=350 GeV/c^, this 
enhancement would be much smaller. Nevertheless the 
possible enhancement from a nonzero Y coupling, and the 
possible suppression from XY interference, signifies an 
interesting scenario: it is possible to discover an HLL that 
is in fact a 0"^"*" resonance, and is produced by exactly 
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FIG. 42: The median of the significance (coloured 2;-"axis") 
for excluding values of (j/-axis) in favor of the 7^ 
hypothesis assuming as correct the values of the a;-axis. 
The tests are performed for Hi^O^^, Uo=0^'-^ (left) and 
Mi=0^'^, Ho^O^-^ (right); mH = 145, 200 and 350 GeV/c^ 
(top, middle and bottom), for Ns=50. 
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FIG. 43: The significance for excluding the C-violating J=0 
hypothesis in favor of a CP- violating case, assuming the lat- 
ter to be correct, with ^xp,xQ=Tr /4. Example for mfr=200 
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FIG. 44: The fully angularly-integrated matrix element 
squared for a "composite" 0^, showing a strong destructive in- 
terference at a given £,xy- The result, shown here for m^f =350 
GeV/c^, is normalized to ^xy=0- 
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the same pp production processes as a SM Higgs, but for 
which the cross section times branching fraction to ZZ is 
several times higher or several times lower than Standard 
Model expectation. 

We evaluate the significance with which one can dis- 
tinguish between a pointlike and a composite 0"*" using 
the same hypothesis-test approach described earlier for 
the CP-violating scalar case. The results are shown in 
Fig. 45. We observe a non-trivial behavior of the signif- 
icance values at and around the critical ^xy- Interest- 
ingly, the qualitative nature of these cancellations also 
changes with mass. For m//=145 GeV/c^ and mH~200 
GeV/c^, the composite scalar with £^xy near the critical 
point is 0+-like, relative to nearby values of £,xy- For 
TO// =350 GeV/c^, it is very difficult to distinguish be- 
tween the composite and elementary hypotheses, except 
if £,XY is close to critical. Near this critical value the 
significance is greatly improved, because after the large 
cancellations the angular distributions of the pure 0"*" and 
the mixed case no longer resemble each other. 

As we discussed for the C- and CP-violating cases, 
an additional question is whether one can distinguish a 
composite scalar from other mixed scalars. We find that, 
compared to the composite case, the two other mixed 
cases are nearly identical. The results for the distinction 
between the CP-violating and composite cases are shown 
in Fig. 46. For large values of £^xy and ^xp, it is possi- 
ble to distinguish between the two hypotheses at a large 
significance with a mere A^s=50. For m/f=350 GeV/c^, 
the composite scalar is very similar to the pointlike 0^ 
-and cannot be distinguished from it- except if ^xy is 
near its critical point. 

Replacing the CP- violating scalar with the C-violating 
one yields results nearly identical to the ones in Fig. 46. 
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FIG. 45: Left: significance for excluding values of ^xy in fa- 
vor of a pointlike 0^ {^xy=0), assumed to be correct. Right: 
significance for excluding a pointlike 0"'' in favor of a "com- 
posite" one {^xY 7^ 0), assumed correct for the ^xy values 
on the a;-axis, for mH = 145, 200 and 350 GeV/c^ (top, middle 
and bottom) and A''s=50. 



F. 0+ vs. general J=l 

In Sec. VII B we discussed the prospects for distin- 
guishing a 0^ from the two pure J^^ spin-one objects, 
vector and axial-vector. Here, we address a more general 
question: how well can one distinguish between 0^ and 
the general family of J=l states? 

The most general vertex describing the coupling of 
a J=l particle a Z pair can be parametrized, for non- 
vanishing X, P, and Q, as: 
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FIG. 46: The median of the significance (colored-labeled z- 
"axis") for excluding values of S,xp (j/-axis) in favor of the 
composite scalar assuming it to be correct with the ^xy values 
of the X-axis, for mH = 145, 200 and 350 GeV/c^ (top, middle 
and bottom) and A''s=50. 



in terms of two mixing angles ^ and S. 

The mixing between the pure vector and axial cou- 
plings is described by ^, while S parametrizes the mixing 
between the CP- and C- violating parts of the interfer- 
ence term in the matrix element squared. In order to 
quantify the significance at which one can distinguish 
between the 0+ hypothesis and the general J=l case, we 
consider two different types of tests, which answer two 
similar questions. 

Assuming a 0+ resonance to be the correct choice, we 
determine the significance with which can we exclude val- 
ues of ^ and S for a J=l hypothesis. We perform a series 
of simple hypothesis tests, for each set of fixed values f 
and S, between the two hypotheses: the test statistic is 
A = log[£(0"'")/£(f , 6)]. The results, as a function of ^ for 
6=7t/2 and to_h=350 GeV/c^, are shown in Fig. 47. The 
points ^=0 and |^|=7r/2 correspond to the pure vector 
and pure axial-vector limits, respectively, and are consis- 
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FIG. 47: Significance for excluding values of for 5=n/2, in 
the general J=l hypothesis [dubbed l^'-''^*)] in favor of the 
0^ one, assumed to be correct. Results for mH=350 GeV/c^ 
and A*'s=50. The dashed line is the median of the significance. 
The 1 and 2 a bands correspond to 68% and 95% median- 
centered confidence intervals. 



tent with Figs. 22 and 23 on these pure cases. 

Assuming a J=l resonance with given ^ and S to be 
the correct choice, we determine the significance with 
which we can exclude the 0+ case in favor of J— I. We 
have to treat ^ and S as nuisance parameters, since we 
are considering the general J=l case. The statistic is 
log[max£(^, j)/£(0"'")]. The results, as functions of ^ for 
i5=7r/4 and m//=200 GeV/c^, are given in Fig. 48, which 
shows that one can potentially exclude the 0+ hypothesis 
without knowing the actual values of ^ and 6. Prospects 
for measuring these angles are discussed in Sec. VII G. 

In Fig 49 we show the significance for the distinction 
between the 0+ and the general J=l func- 
tion of ^ and S, for mjj=145, 200, and 350 GeV/c^ No- 
tice that the significance levels colour-coded as a z- "axis" 
range over a small interval. This means that the entire 
J=l family is almost "equally dissimilar" to 0+. In gen- 
eral, one's ability to exclude J=l relative to 0+ is greater 
than its opposite, due to the required treatment of f and 
6 as nuisance parameters, although the differences are rel- 
atively small in magnitude and in ^- and (5-dependence. 

The fact that the significance plane as a function of 
^ and S is relatively flat means that, with some mn- 
dependent amount of observed events, one shall be able 
to unambiguously exclude the general J=l hypothesis in 
favor of the O"*" case (assuming it to be correct) or vice- 
versa, regardless of the values of ^ and S. Using the pure 
J-PC" hypothesis test as a guide, we conclude that the 
median expectation for differentiating between 0"*" and 




FIG. 48: Significance for excluding the 0^ hypothesis in favor 
of the general J=l case [dubbed l^*-^'**'], assumed correct for 
^ as in the a;-axis and (5=7r/4. Results for mn—'iOO GeV/c^ 
and A^s=50. The dashed line and bands are as in Fig. 47. 
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FIG. 49: Left: Median of the significance (coloured z-"axis") 
for excluding values of ^ and 5 corresponding to a J=l hy- 
pothesis [dubbed l^'-^f^'''] in favor of 0"'', if the latter is correct. 
Right: vice-versa, with values of ^ and 5 indicated on the axes. 
Results for mH~U5, 200 and 350 GeV/c^ (top, middle and 
bottom), for A''s=50. 
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J=l should exceed 5 a with Ns ~ (60, 200, 85) events for 
TOi/=(145,200,350) GeV/c2, respectively. 

Additionally, based on our results concerning the dis- 
tinction between 0^ and the two pure J=l states, and 
the results on the mixed J—Q hypotheses, we conclude 
that it is equally easy, or even easier, to distinguish be- 
tween J=l and a J—Q state other than 0+. Hence, with 
the numbers of events listed above, it is likely that one 
will be able to unambiguously exclude the J=l family 
of hypotheses in favor of a general J=0 hypothesis, or 
vice-versa, if the resonance is either one or the other. 
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G. Parameter estimation in mixed J 
cases 
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Were one to find out from real data and the hypothesis 
tests discussed in the previous section that a mixed J—Q 
or J—1 state is the preferred description, the next item 
in the context of this analysis would be the measurement 
of its mixing parameters (in a larger context one would 
include at this stage the measurement of decay branching 
ratios). 

We have seen in Sees. VII E and VII F that our hy- 
pothesis tests can demonstrate -if correct- and with com- 
putable significance, that a standard 0"*" particle is disfa- 
vored relative to a mixed scalar or vector with unspecified 
HZZ coupling ratios (or mixing angles). In these tests, 
the angles were treated as nuisance parameters. Their 
measurement proceeds along the same line -the preferred 
value is simply that which maximizes the likelihood- but 
the treatment of confidence intervals need be different. 

More specifically, each mixed hypothesis family is char- 
acterized by mixing angles ^. For each "experiment" , N 
events are simulated, each one characterized by a vec- 
tor Xe — {w, r2, Af^.jle. The likelihood for a particular 
family of hypotheses is £(|) = H^i Pe{xe, 0- The mea- 
sured values of the mixing angles, ^moas, are chosen to be 
those that maximize the likelihood. 

To assign confidence intervals to these measurements 
we use a fully frequentist approach. An ensemble of "ex- 
periments" is performed with fixed input values ^=^input- 
For each experiment, the measured values of ^ are taken 
from the maximization of the likelihood. This procedure 
is repeated for a fine matrix of input values, covering 
the allowed parameter space. From the probability dis- 
tribution functions P(^moas|6nput), estimated using this 
ensemble of experiments, the Feldman-Cousins unified 
approach [48] is used to choose which elements of proba- 
bility are included in confidence intervals. 

As an example, consider the CP-violating scalar case, 
discussed in Sec. VII E. The confidence intervals for mea- 
sured values of ^xp (the mixing parameter that charac- 
terizes this hypothesis) are shown in Fig. 50 for different 
values of toj^. The way to interpret these figures is as 
follows: For a particular set of data -one experiment, 
which in this case includes A'^s=50 observed events- an 
input value of ^xp (to be read on the x-axis) results 
in a measured value to be read (with its error bands) 
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FIG. 50; Confidence intervals for measured values of (^xp for 
a CP-violating J=0 resonance, for m//=145, 200 and 350 
GeV/c^ (top, middle and bottom), all for A^s=50. For mea- 
sured values of £^xp on the y-axis, confidence intervals should 
be read horizontally, see text. 



on the y axis. The confidence intervals are obtained by 
drawing a horizontal line passing through the measured 
S^xp- The overlap of this line with the na bands dic- 
tates which values of "input ^xp" should be included in 
the na confidence intervals. For example, for m/f=200 
GeV/c2 (middle of Fig. 50) we see that, if CxT=^^ the 
3(7 confidence interval is approximately £^xp G !]• 

The I fj bands in Fig. 50 are centered on the diagonal 
^moas_^input^ iniplying that there is no significant bias in 
the measurement. In addition to this, the 2 a and 3cr 
bands also cover most of the diagonal ^^f^ — — Cxp"*- 
This confirms our observation from Sec. VII F that our 
ability to pin down this parameter comes predominantly 
from measuring the relative strengths of the 0^ and 0^ 
parts of the pdf rather than the nature (P-odd) of its in- 
terference term. An increased number of observed events 
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FIG. 51: Confidence intervals for measured values of ^xq for a 
C-violating J=0 resonance for mH=U5, 200 and 350 GeV/c^ 
(top, middle and bottom), all for 7Vg=50. For measured val- 
ues of ^xQ on the y-axis, confidence intervals should be read 
horizontally, see text. 



FIG. 52: Confidence intervals for measured values of (^xy 
for a "composite" J=0 resonance, for m_ff=145, 200 and 350 
GeV/c^ (top, middle and bottom), all for A^s=50. For mea- 
sured values of ^xy on the y-axis, confidence intervals should 
be read horizontally. 



is needed to fully resolve this sign ambiguity. 

In Fig. 50 we see that for m//=145 GeV/c^ (but not for 
mif— 200 GeV/c^) the size of the confidence intervals for 
^xp decreases with increasing |^xp|- This is due to the 
effective coupling strengths of the 0+ and 0~ parts of the 
pd/ differing by a factor of ^ 10 at mH=145 GeV/c^ but 
not at the other masses. Hence, at the lowest mass, only 
at tan^(^xp) ~ 10 does the pd/ exhibit 0+- and 0~-like 
behaviours of similar magnitude. 

Confidence intervals for measurements of the param- 
eter ^xQ for a scalar with C-violating HLL couplings 
are shown in Fig. 51; These are nearly identical to those 
in Fig. 50, reflecting the difficulty of discriminating the 
Cxp 7^ and ^xq hypotheses, as discussed in 
Sec. VII E. For the C-odd case, the sign ambiguity of 
£,x'q^ is slightly worse than for the T-odd one as demon- 



strated by the 1 a confidence bands appearing on the 
^^^a*' = -^^P"* diagonal for to^/=350 GeV/c^. This 
is also expected, since the C-odd interference term is 
proportional to the relatively small number ry « 0.15, 
see Eq. (22). One's ability to distinguish between J—0 
C- and T-odd admixtures relies on the resolution of the 
interference terms. With a factor of 10 more statistics 
[Ns ^ 500), one would be able to resolve the sign am- 
biguity in ^xp and S^xQ and to distinguish between the 
two cases. 

The confidence intervals associated with measurements 
of for a composite scalar are shown in Fig. 52. We 
observe that, for m/f =145 and 200 GeV/c^, the 1 a inter- 
vals are centered on the diagonal ^^y^'* ~ Cxy^* ■ There 
are no bands along ^xy"" ~ ~Cxy^^ ^ since the interfer- 
ence term is of a different nature than that of the discrete- 
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symmetry violating cases. The extensions of the 2 and 3 a 
bands along almost horizontal and vertical lines around 
^XY ^1-3 result from large cancellations in the pdf, dis- 
cussed in Sec. VII E. 

The figure for m/f=350 GeV/c^ is hard to decipher. 
With a magnifier one sees that at the critical value of 
^XY the confidence intervals are tiny. Everywhere else, 
the intervals essentially include all possible values except 
the critical one. This is tantamount to saying that at this 
mass we cannot tell, on the basis of our analysis, a com- 
posite from a pointlike scalar unless is has a particular 
value of £,XY, a fact made clearer by Fig. 45. 

The other mixed case we study is that of a general J—\ 
resonance, parameterized by angles ^ and 5 as described 
in Sec. VII F. We saw in Sec. VII D, that most difficult 
distinction is the one between the two pure J^'-^ spin- 
one resonances, indicating that these two cases are very 
similar. This is what we find again when exploring the 
potential for measuring ^ and 5. 

In Figs. 53 we show as an example the confidence in- 
tervals for measurements of ^ and 5 at m//=145 GeV/c^ 
The ability to resolve the value of the P-mixing angle ^ 
is modest. The measurement of the CP-mixing angle 5 
is still harder. Specifically, we see a large sign ambiguity 
in the measured 5, indicating that, with Ns ~ 50, it is 
difficult to resolve the nature of the interference term, as 
was the case for J—Q. 

Overall, we find that a precise measurement of ^ and 
5 for a J—1 resonance is very difficult. The conclusion of 
this section and Sec. VII F is that, if a new J=l boson is 
found, a modest number of events will suffice to exclude 
J=0, 2 alternatives with high significance. Before many 
more events are gathered, and with only the tools we have 
studied, it is hard to make precise statements about the 
nature of a J=l resonance, other than its spin. 





H = l"^''**' m„ = 145 GeV/c- N'^" = 50 




FIG. 53: Confidence intervals for measured values of ^ and 
5 for a J=l resonance with a mass 145 GeV/c'^ and A^s=50 
events. The input values, indicated by diamonds, are reported 
alongside the figures. 
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VIII. CONCLUSIONS, CAVEATS, AND 
OUTLOOK 

It is no surprise that using all of the decay information 
in a data sample provides better discrimination of the 
identity of a new heavy resonance than examining a sin- 
gle angular distribution or asymmetry. Nevertheless, one 
might be tempted, given a small data set constituting an 
initial discovery, to settle for a stripped-down analysis. 
Our study quantifies the cost, in units of integrated LHC 
luminosity, of pursuing such sub-optimal analysis strate- 
gies, as illustrated in Fig. 54 for the benchmark m^f =200 
GeV/c2. 

In this figure we compare the discrimination between 
the 0"*" and 1^ hypotheses for likelihood definitions that 
exploit different sets of variables, with the notation that 
P(ai,--- ,aAr) denotes N-dimensional pdfs in the cor- 
related variables {ai,-- - jO^v}- Here Yl^PiXi) is con- 
structed from one-dimensional pdfs for all variables, ig- 
noring (erroneously) their correlations. P{lu \ {CI)th) are 
pdfs including the variables uj and their correlations, but 
with the hypothesis 1~ represented by a pdf in which the 
variables 5l={<I>, cos 0} have been integrated out. 

The likelihood P{uj \ {(1)t^h) performs badly even rel- 
ative to P{uj), which uses fewer angular variables. The 
two differ only in that the first construction implicitly as- 
sumes a uniform 4 tt coverage of the observed leptons (an 
assumption customary in the literature) as if the muon 
Pt and rj analysis requirements did not depend on the 

angular variables. The differing results arise from the 
strong correlation between the variables and ((> in the 
J=l pdfs^ such that phase space acceptance sculpting of 
the $ distribution alters the distribution, as discussed 
in Sec.IV and VII B. 

Additionally we find that treating the correlated angu- 
lar variables as uncorrelated, as in the J^^ P{Xi) example 
of Figure 54, not only degrades the discrimination signif- 
icance but also produces a real chance of falsely labeling 
the quantum numbers of the new resonance. Assume for 
example the SM, with mjy=200 GeV/c^. Let the data be 
fit to either a fully correlated pdf oi an uncorrelated one. 
The projections of the corresponding theoretical pdfs, in- 
volving only the variables cos 9i and cos 62 , are illustrated 
in Fig. 55. On the top (bottom) of the figure we show 
P[cos6'i, cos 6*2] (P[cos6'i]xP[cos6'2]). With limited stat- 
ics - insufhcient to distinguish between the correlated 
and uncorrelated distributions ~ the correct conclusion 
will be reached: the data are compatible with the SM. 
But, as the statistics are increased, the data will signifi- 
cantly deviate from the P[cos6'i] x P[cos6'2] distribution, 
and a false rejection of the SM hypothesis would become 
increasingly supported. 

Overall, we have demonstrated that small signal sam- 
ples in the ZZ — >• M or ZZ* — >• M decay channels, as 
might be available at the moment of discovery, could be 
sufficient to characterize a putative Higgs particle. Below 
we summarize these results in more detail. 
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FIG. 54: Median significance for rejecting 1~ in favor of , 
for different likelihood constructions used in the log likelihood 
ratio test statistic. is always considered the true hypoth- 
esis. 




FIG. 55: The normalized theoretical pdfs in the variables 
cos 01 and cos 62 (integrated over (j)) for J=0^ and mH=200 
GeV/c^ Top: the 2D prf/Pi2=P[cos6li, COS6I2]. Bottom: the 
product Pi * P2=P[cos6li] x P[cos6l2] of the ID pdfs. 



A. Summary of pure case discrimination 

Amongst the many comparisons considered in our 
analysis, the ones between simple hypotheses are the 
most readily summarized. This we do in Tables I,II for 
miif=145 GeV /(? for all pure-case comparisons between 
J=0, 1 parent particles, and in Tables III, IV (V,VI) for 
mij=200 (350) GeV/c^, for all pure-case comparisons be- 
tween J=0, 1, 2 parent particles. 
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TABLE I: Minimum number of observed events such that the 
median significance for rejecting Ho in favor of the hypothesis 
Hi (assuming Hi is right) exceeds 3cr with mif=145 GeV/c^. 
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TABLE IL Same as Table I, but requiring that the median 
significance exceeds 5 a. 
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66 
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86 


45 


54 
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TABLE III: Minimum number of observed events such that 
the median significance for rejecting Ho in favor of the hy- 
pothesis Hi (assuming Hi is right) exceeds 3cr with mif=200 
GeV/c^ 

Overall, the discrimination power of the hypothesis 
tests is very impressive. The mH=200 GeV/c^ bench- 
mark example is the one requiring the largest statistics to 
reach a given discrimination at a given level of confidence. 
Compared with the toh=350 GeV/c^ case, this is be- 
cause various coefficients of the angular dependences van- 
ish at the toh=2Mz threshold. The mi/=145 GeV/c^ 
example fares better than the 200 GeV/c^ one for the 
same reason, amplified by the extra lever-arm supplied 
by a non-trivial Mz* distribution. 

The tables also show that the discriminating power be- 
tween two given hypotheses is approximately symmetric 
under the interchange of 'right' and 'wrong'. Telling 1"*" 
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TABLE IV: Same as Table III, but requiring that the median 
significance exceeds 5 a. 



Ho J| Hi ^ 


0+ 


0" 


1" 


1+ 


2+ 


0+ 




8 


21 


24 


11 


0" 


9 




22 


22 


36 


1" 


24 


22 




81 


46 


1+ 


26 


22 


80 




56 


2+ 


15 


39 


55 


73 





TABLE V: Minimum number of observed events such that the 
median significance for rejecting Ho in favor of the hypothesis 
Hi (assuming Hi is right) exceeds 3cr with m/f=350 GeV/c^. 



from 1~ is always difficult but not impossible, a fact of 
relevance for a Z' look-alike analysis. The level of signif- 
icance does not obey a naive N{a) oc y/Ns law. However 
we find by inspection that an approximation of the form 
N{a) = a+b ^/Ns works well, allowing one to extrapolate 
to larger numbers of events than presented here. 

Other lessons from the tables are case-by-case specific, 
reflecting the mass-dependent quantum-mechanical en- 
tanglement between the decay variables. Some examples 
are: distinguishing the 'natural-parity' J=0^ and 1^ hy- 
potheses for mH~14:5 GeV/c^ requires only a dozen sig- 
nal events for 3a discrimination. For 200 GeV/c^, dis- 
criminating 0^ from 0^ is relatively easy, but distinguish- 
ing 0+ from 2+ is difficult. For 350 GeV/c^, contrariwise, 
2+ is relatively easy to disentangle from 0+ , but not from 
0". 



B. Summary of mixed cases, CP and 
compositeness discrimination 

We find that direct sensitivity to CP odd, parity odd 
XP interference eff'ects, or to CP odd, parity even XQ 
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TABLE VI: Same as Table V, but requiring that the median 
significance exceeds 5 a. 
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interference effects, will require signal samples about an 
order of magnitude larger than considered here. We have 
also observed that with much smaller statistics it may be 
possible to conclude that a mix of X and P (or X and Q) 
couplings is favored over just the pure X (i.e. 0+) or pure 
P (i.e. 0~) couplings alone. Such a conclusion would be 
tantamount to demonstrating CP violation in the Higgs 
sector. However this scenario relies on large CP viola- 
tion, and even in this favorable case one cannot tell an X 
and P mixture from an X and Q mixture without more 
data than what is required to establish discovery. 

In the case of a composite Higgs, it may be conceiv- 
able that the Higgs is as 'soft' as a pion, in the sense of 
having an inverse radius and a mass of comparable mag- 
nitude. In this scenario we have seen that the angular 
distributions associated to the X and Y couplings are 
similar after integrating over the decay angles. As a re- 
sult there can be strong destructive interference between 
these contributions. For our lighter mass benchmarks 
we find good discrimination of pure 0'^ from the mixed 
composites. For the heavier mi/=350 GeV/c^ example, 
discrimination based on decay angles is poor unless the 
strong interference effects are present; here we also ob- 
served that substantial enhancement or suppression of 
the HLL^ ZZ branching fraction can provide another 
important discriminator. 

For mixed cases, one could worry that certain combi- 
nations of exotic couplings might let an HLL successfully 
masquerade as a 0+ Higgs, even when all the pure case 
exotics are excluded. For spin 1 HLLs we have shown 
that this does not happen. In fact we find that when we 
have an SM Higgs, the entire family of mixed coupling 
spin 1 HLLs can be excluded at approximately the same 
expected level of significance as for the pure 1~ or 1''" 
cases. An even stronger result is that the general spin 
hypothesis can be conclusively discriminated from the 
general spin 1 hypothesis, at or close to the moment of 
discovery. 



C. Analysis limitations 

In our analysis wc focused on decay information, ex- 
ploiting an approximate factorization between observ- 
ables related to Higgs (or HLL) production and observ- 
ables related to decay. The factorization is only approx- 
imate because of phase space acceptance effects and, in 
the case of spin >0 HLLs, correlations between the ini- 
tial and final state particles. In a real data analysis 
one would want to include production information, which 
in turn would require a detailed knowledge of radiative 
corrections, PDFs, and full detector simulation for the 
HLLs. Such an analysis is beyond the scope of this pa- 
per. Within our narrower scope we have incorporated as 
much as we could all the issues that make a conceptual 
difference in the strategy. When we have made approx- 
imations or have neglected certain effects, it is because 
the detailed inclusion of these effects would not have a 



qualitative impact on our results. 

The QCD corrections to the signal predictions for 
cPa/dpT dr] are large, as is well-studied for the SM Higgs 
(see, for example, [4], [50, 51] and references therein.) The 
impact on the total cross sections is not relevant to our 
analysis, but the corrections to the {pt,ii) distributions 
will modify the phase space acceptance effects on the 
distributions of the final-state leptons. For the SM Higgs 
wc have included these corrections at NLO, and a re- 
cent study shows that the effects of NNLO corrections on 
the final-state lepton distributions are not dramatic [52] . 
We have not included the differences between the phase 
space acceptance effects for the SM Higgs and those for 
the various HLLs, but we performed a comparison at LO 
to see that these differences are small compared to the 
acceptance effects themselves. 

There are electroweak radiative corrections that di- 
rectly involve the final-state leptons. For the SM Higgs 
these corrections have been computed and studied in de- 
tail [36]; the corrections are of the order of 5 to 10% 
and cause a mild distortion of the angular distributions. 
These effects should be included in a complete analysis, 
but they do not introduce anything conceptually new to 
the methodology proposed in this study, and their in- 
clusion involves details of the experimental treatment of 
the vertex and subsequent radiations by electrons and 
muons. 

We only considered the dominant ZZ background, and 
only at LO. It would be useful to include a more compre- 
hensive treatment of the SM backgrounds to the golden 
channel and to use the full signal-to-background discrim- 
ination, e.g. by adding the Z mass distribution to the 
likelihood definition. A complete treatment of the back- 
grounds would require full detector simulation. 

Our treatment of couplings and HLLs was not exhaus- 
tive, since we have ignored gauge invariant operators with 
dimension greater than 6, have only examined one case 
of spin 2 HLL, and have not even mentioned the possi- 
bility of HLLs with spins higher than 2. At some point 
Occam's razor obviates the need for such comparisons: 
"Raffiniert ist der Herr Gott, aber boshaft ist Er nicht", 
to quote a known author [53]. 

The likelihood analyses pursued here are very com- 
puting intensive, since 5<t discrimination implies simu- 
lating sufficiently many pseudo-experiments to fill out 
what amounts to the 5 a tails in multidimensional likeli- 
hood distributions, where they are typically highly non- 
Gaussian. The analysis presented here used more than 
10^'* pseudo-experiments in total. 



D. Outlook 

We have seen that by exploiting the full decay infor- 
mation in the golden channel we should be able to say 
a lot about the identity of a putative Higgs resonance 
around the moment of discovery. Our results also show 
that asymptotically, utilizing the full physics run of the 
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LHC, it should be possible to explore very detailed prop- 
erties of such a resonance. 

It has not escaped our attention that there are many 
processes other than the ZZ decays of a heavy resonance 
whose characterization may benefit from an analysis of 
the kind that we have performed here. 

Note added: 

While this manuscript was in preparation we received 
the preprint [54] , reporting on an analysis similar to what 
we have presented here. 
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otherwise non-standard HLL, dubbed <1> before the spon- 
taneous symmetry breaking. Call V"^^ any of the field 
tensors in Eq. (Al). The operators in this Lagrangian 
could be "descendants" of dimension 6 operators of the 
form $t$y2^ .^j^j^ $ _^ ^ ^ ggg g g j22]. In such a 

case there would be a standard-like 5^1/ coupling plus the 
one induced by the higher-dimensional operators. 

For the case of a spin-1 neutral HLL, iJp, the lowest- 
dimension gauge-invariant Lagrangian generating the 
couplings of Eq. (10) is built of operators of dimension 6: 

A^L = {d^^ + H^^) (Al W^x + A2 B^B^x) 

where Da is the covariant derivative and {M^ = 
MDaN - {Da M)N. 

For a canonical-dimension spin-2 neutral HLL, ff^^i/, 
the lowest-dimension gauge-invariant Lagrangian has 
couplings of dimension 5: 

L=^H^, (Al W"'' + A2 B^B"") 
+ j H^P i e^^adiAs W^"^WP^ + A4 B^^'^BPf') . (A3) 

The consideration of gauge-invariant constructions for 

HLLs with non-trivial electroweak charges would take us 
well beyond the scope of this paper. 



Appendix A: SU(2)l x U(1)y gauge- invariant 
couplings 



Appendix B: Phase space for ZZ* 



To write Lagrangians generating the couplings of 
Sec. Ill A and respecting the electroweak gauge symme- 
try one must specify the electroweak charges of the Higgs 
look-alikes. Consider the example of HLLs that are "neu- 
tral" , i.e. are weak singlets and have zero hypercharge. 
For the scalar case, in a conventional notation for isovec- 
tor and isoscalar gauge fields, the lowest-dimensionality 
Lagrangian density is: 

L = ^ (Al W^a W^^'' + A2 B^aSn 

+ ^Hl e'^"^-(A3 W^aW^, + Ai B^aBar) , (Al) 

with Aj arbitrary constants and A a mass parameter. 
This object generates, amongst others, the couplings of 
Eq. (9). The "true" dimensionality of the operators in 
Eq. (9) is that of the ones appearing in Eq. (Al), that is, 
dimension five. 

The form of Eq. (Al) results in a coupling 
HZ^aZ^"' — 2pi • y»2 5/na — 2fc^fc(j, establishing a re- 
lation between X and Y + iZ in Eq. (9). We do not im- 
pose it, for it is not general even at tree level. Consider, 
for instance, a model with a conventionally-charged but 



In the case in which one of the two Z bosons is off-shell, 
the dependence on its mass (M^. , either mi or 7712) is 
an extra handle in determining the shapes of signal and 
backgrounds. Let Pcms = = J^i 7i /?! = "'^ 72 a'32 

be the momentum of one or the other Z in the H center- 
of-mass system: 



Pcms = 7, ®[mH - {Mz + Mz')\ X 

^m]j-{Mz-Mz'Y ^m%-{Mz + Mz^Y • (Bl) 

Let M. be the matrix element for the process. The ex- 
pectation for the rate of events, including the dependence 
on Mz* , is: 



dN 



rfcos 61 rfcos 62 d(j) dcos 9 dMz* 

2 Mz' Perns 



{Ml,-MlY + Ml, r|' 



(B2) 



with |A^p an explicit function of ci, C2, (j), O, $ and Mz- 
for each specific case to be discussed. 
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Appendix C: General results for spin coupled to 

ZZ* 



In Section III we have already written the angular dis- 
tributions (ir[0+] and (ir[0~] for the pure scalar and pseu- 
doscalar cases, see Eqs. (14), (15). We also discussed 
the T-odd and C-odd interferences between the standard 
coupling -proportional to X in Eq. (9)- and the P and Q 
terms of the same equation. Thus we defined (ir[0, Todd] 
and dr[0, Codd] in Eqs. (21), (22). Similarly we discussed 
the complete result for the 'composite' case with X 
and r 7^ 0, defining dTxY and dVyY in Eqs. (24), (25). 
This allows us to gather the results corresponding to the 
most general deviations from the SM Higgs couplings: 

dr[o] = dr[o+] + {p^ + q2) dr[o-] 

+XPdr[Q, Todd] +XQ(ff[0, Codd] 

+xYdrxY + {Y^ + z'')drYY . (CI) 

To obtain the complete spin result one must add 
to Eq. (CI) the interferences between the non-standard 
terms themselves: 

Adr[0] = XZ dVxz + YPdTYP 

+ YQdTYQ + ZPdTzP + ZQdTzQ, {C2) 

where 

dTxz = 2T]mlml m\ lb (ci + C2) s Si S2 , (C3) 
dTvp = dVzQ = -2mimi-f^ssiS2{ciC2 + jf) , (C4) 
dTyQ = -dVzp = 2rim\m2llc (ci + C2)siS2 • (C5) 



Appendix D: General results for spin 1 coupled to 

ZZ* 



We produce a spin 1 HLL from annihilation of qq with 
quark helicity t/2, t—±\. To an excellent approximation 
the coupling of the HLL to light quarks must conserve 
helicity, so the antiquark has helicity —t/2. Then the 
HLL decays to ZZ (or ZZ*), with Z2 At~/U+ with 
muon helicity (72/2 and Z^ — >■ e~e+ with electron helicity 
ai/2. 

The fully differential cross section is a sum over r, (j\ , 
CT2 of the squared absolute values of the helicity ampli- 
tudes. In addition the (unmeasured) helicities Ai, A2 of 
Zi, Z2 are summed over 0, ±1, before squaring. 

We use the following notation to denote the helicity- 
conserving coupling of a Z boson to a massless fermion 
of helicity (t/2, (t=±1: 



T = ±1: 



1 



{Cy - aCa) . 



(Dl) 



Similarly, we denote the helicity conserving coupling of a 
vector boson HLL to a massless fermion of helicity t/2. 



9r = ^{9v- TQa) • 



(D2) 



In the full matrix element squared, the dependence on 
these vector-fermion-fermion couplings is 



1 
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(cl + cl) {gl+gl) 
-2c^Ca {cl + cl) {gl + gl) (di + CT2) 
-2 {cl + g-oQa T + Aclcl {gl + gl) aia2 

+4c^Ca {cl + cl) g^ga (uiT -I- C72t) 

-^clclgyga CT1CT2T 
from which we derive the shorthand notation 

91 = {cl + cl)\gl + gl) 

9a = -^CyCa {cl + cl) {qI + qI) 

gr = -2 {cl + clf g^ga 
gaa = 4:clcl {gl + gl) 
g„r = 8c„Ca {cl + cl) g^ga 

= _S 2 2 
gacrr — ^CyC^gyga • 



(D3) 



(D4) 



We allow both Z bosons to be off-shell, with invariant 
masses mi and TO2. Some useful mass combinations are 



Ml 



"'2 I 

3m? 



(D5) 



'2 ) 



H 



3to: 



2 ) 



M^ = m%- 2{m\ + ml) 



Ml 



mjj — {mi + 7712) . 



One of the advantages of using helicity amplitudes is 

that we can keep track of which contributions come from 
the longitudinal polarization of the HLL rather than 
the transverse polarizations. We use the notation i"^, 
Iq to flag the parts of the squared matrix element that 
come from the transverse, longitudinal polarizations of 
the HLL, and i£o to flag contributions from the interfer- 
ence. 

We define 6 to be the polar angle of the incoming 

quark with respect to the z-axis defined by Z2 in the 
HLL rest frame. This raises a problem since at a pp col- 
lider we cannot distinguish the quark direction from the 
anitquark direction in a gg-initiated process. A solution 
is to symmetrize the cross section between the case where 
O is the polar angle of the quark direction and the case 
where Q is the polar angle of the antiquark. In the cou- 
pling notation defined in (D4), this symmetrization has 
the the same effect as setting gr, gar, and ga-a-r to zero. 

The standard convention in the literature for the three 
azimuthal angles is somewhat peculiar. The coordinate 
axes are chosen such that the outgoing muon moves along 
the y-axis in the rest frame of the HLL (or equivalently 
of Z2). Thus the azimuthal angle of the muon is w/2, 
while the azimuthal angle of the outgoing electron is de- 
noted (p — tt/2. We denote the azimuthal angle of the 
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incoming quark by This choice of conventions leads 
to rather awkward expressions for the angular distribu- 
tions. A better choice is to align the axes such that the 
quark azimuthal angle $ = 0. The remaining azimuthal 
dependence is then denoted by (fi and (^2, such that the 
substitutions ipi ^ ^ + <p, (fi2 ^ ^ regain the previ- 



ous convention. We will employ this notation in this 
appendix, which makes the formulae more symmetrical. 

After the quark-antiquark symmetrization described 
above, the XX part of the full matrix element squared 
is given by 



^m\ra\X'^-il 51525151 (2^^m^ - ^^m^^ [m? cos(2(^i) + cos(2(^2)]) (D6) 
+gii'^m^{l + C^) [2m|si + 2m\s\ — (mj + 7712)5182] + 'li£ogimH'rri^ C S [miCiSiS2 sirnpx — m2C2S2s\ sin</?2] 
-2^2m|^mim2SiS2 ((1 + C'^){giCiC2 - g^^) cos((/?i - (^2) + S'^{giCiC2 + g^a) cos((pi + ip2)) ■ 



The PP part is given by 



'^gim]jS'^s\sl [M^mX cos(29?i) + M^ml cos(2(/?2)] 

+8£lmlmlmjS^ [gi (cj + cl + sfsj sm{(pi - (^2)^) + 2gaaCiC2] 
+(1 + C^)fgim% [2Mfmlsl + 2M|m?si - {M^mi + Mtml)slsl] 
-8UomHTnlmim2C S [M|miS2 (51C2S1 sint^i cos((^i - (^2) + ci{giCiC2 + gaa) sin (^2) 
-Mfm2Si {gicisl sin (^2 cos(<^i - (^2) + 02(310102 + g^a) siny^i)] 

+2^2m^Mf M|mim2SiS2 [(1 + C^)(5'iCiC2 - g^a) cos(<^i - (^2) - S'^{giCiC2 + gaa) cos((/9i + (^2)] 
The XP and XQ interference parts are given by 



(D7) 



4mim2XP'yb d^gi'm^jS'^ slsl{M'lml sin(2(^2) - M^ml sin(2<^i)) 

+2£iogimHrn^C S [m2SiC2S2(2mi simpi sin(i^i — (P2) — cos 1^2) 

— TOi,S2CiSi(2m2 sin 1^2 sin((^i — 932) +M| cos (^i)] 
-2mim2SiS2 [(1 + C^)£^nijjM^{giCiC2 - gaa) sm{(pi - (^2) 

+TO^s2(.giCiC2 + gaa)i£^rnjj sin((^i + (P2) + '^£l'rnl ain{(pi - 1P2))] 

-4:££omHmim2ml C S [m2Si (31 ci + gaaC2) cos (pi + miS2 (51 C2 + gaaCi) cos (^2] 



(D8) 



'imim2XQ 7(, ££QgamH'rn^C S ^m,2S^S2 (2m^ cos((^i — (^2) sin ipi — Mf sin (^2) 

—771152*1 (2m2 cos((^i — ip2) sin 992 — M2 sin 1^1)^ 
+£'^gamjf{l + c^){M^mlslc2 + M^m^slci) + mim2SiS2 [(1 + C'^)£^ gamjjm'^{ci - C2) cos((^i - (^2) 

-gas^ici + C2){£'^'m}jMl cos(93i + (^2) + 2£om^ cos((pi - (^2))] 
+2££QgamH'm^mim2C S (l + CiC2)(m2Si sintpi — miS2 sin (^2) 



(D9) 



Without the quark-antiquark symmetrization, one adds: 
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SmHTTiim'^X'^^^g^T i'^niHC (m2C2Si — mlciS2 — mim2(ci — C2)siS2 cos(<y?i — (^2)) (DIO) 

—Uom^S {m2SiS2 sin(p2 + miS2Si sin ipi) 
+2mHP^gcTT fmnC {NIim\c2s\ - M2m\cisl + M|mim2(ci - C2)siS2 cos((^i - (^2)) 
+2££o"^i'7i2TO^'S' ((1 + cxC2){Mlm2Si sin 951 + M|miS2 sin(p2) 

— cos(</?i — (/?2)(Af|TOis^S2 sini^i + Mlm2s\s\ sin (^2)) 

-4mHmim2X P jhgar 2£^mijM|mim2C (ci - C2)siS2 sin((^i - (^2) 

+££0171^8 [m2sls2{M^ cos<p2 — 2mf siny>i sin((pi — 1^2)) — miS2Si{M^ cos<^i + 2m2 sin 1^2 sin(<^i — (^2)] 
+2mim2(l + ciC2)(m2Si cos - miS2 cos 932)) 
AmHmim2XQ^b 2£'^grmHC {2Mfmlsl - 2M2m\sl + Mlm\s\sl) 

-AfmHmlm-im2c{g^„r - 5rCiC2)siS2 cos(<^i - (^92) 
+2^fo5r'Ti^<S' [m2C2S^S2(2m^ cos(<^i — (^2) sintpi — M| sin 1^2) + miCiS2Si(2m2 cos(<^i — (^2) sin((^2) — Aff sinc^i)] 

+4^4mim2m^5 {m2Si{g„^rC2 + 9tCi) simpi + miS2(5<r<7rCi + firrC2) siny-j) 



In the limit that both Z's are on-shell, mi=m2=Mz, ma -> 0, Mi=M2=M3 Amjjy'^ / x"^ , M4 m'jj{x'^ - 
we introduce the notation of Buszello et al.: x = 2)/x^, and 7b xy. For the full symmetrized matrix 
itih/Mz, = {x^ — 4)/4. Then we can simplify using element squared the result is: 



_/2 8 2 



2{x^X^ + {x^ - 4)P2) + C'2)(l - clcj) - SHg,„ + .9iCiC2)siS2 cos(^i + ^2)] 
-{x^X^ - - 4)p2) cos(2<^i) + cos(2<^2)) - 2(1 + C^)(g„a ~ 5iCiC2)siS2 cos(^i - (^2)] 

-AXPxysiS2[giS'^siS2{s\n{2Lpi) - sin(2y72)) - 2(1 + C'^){g^a - giCiC2) sin((/3i - ^32)] 

■^4.XPxyg„[{l + C^){c2sl + cisl)-S'^{ci+C2)siS2COs{ipi+ip2)\\ ■ (Dll) 
If we simply set 6 = 0, the above simplifies to: 



16 



l^'^mW [giix'^X'' + - 4)p2)(i _ c^c^) + [x'^X'' - (a;^ - A)P''){g,, - 5iCiC2)siS2 cos(^i - (^2) 
+AXPxysiS2{gaa -5iCiC2)sin(<^i - ip2) + 2XPxyga{c2sl + Cisl) 
This agrees with the result of Buszello et al. [29]. 



(D12) 
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